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Ozone Spectra 


A survey spectrum of ozone between 1850 and 4500 cm -1 is shown in 
Fig. 1. The sample contained approximately 10 atm cm O3 at 750 Torr total 
pressure (0 2 + O3) and the spectral resolution is about 0.5 cm -1 . At this 
pressure and resolution few of the details in the bands can be identified. 

In addition to bands of ozone, there is also absorption by IfeO and C0 2 in 
the optical path outside the absorption cell, and absorption by CO pro- 
duced by reactions of ozone with carbon compounds in the cell and CH4, an 
impurity in the oxygen. 

Absorption by ozone occurs from 1850 to 2250 cm -1 . This is due to 
the wings of the v x + v 2 band near 1790 cm" 1 , by the v x + v 3 band at 2110 
cm" 1 , and the 2v x band near 2200 cm" 1 . The v 2 + 2v 3 and v x + v 2 + v 3 bands 
are seen between 2700 and 2800 cm" 1 . There is also absorption by O3 be- 
tween 2950 and 3050 cm" 1 and weaker features near 3200 and 4030 cm" 1 . 

High resolution spectra of these bands will be obtained in the next several 
months and an atlas showing their appearance for different sample condi- 
tions will be prepared together with a listing of the line positions. Es- 
timates of the line intensities and widths will be obtained in future work. 

The spectra in Fig. 1 and other studies have shown that bands of 
ozone consist of densely packed series of lines which are incompletely 
resolved even with a spectrometer of 0.04 cm" 1 resolution. The blending 
of the lines makes it difficult to measure the line positions and even 
more difficult to estimate the line intensities and widths. The conven- 
tional methods of estimating these latter parameters from measurements of . 
the equivalent widths fail and other methods of analysis are desirable. 

A very powerful method of extracting line parameter information from 
spe ctra has recently been develop ed by our group. This cons is ts of n on- 
li near , least- squares , fitting of the observed sp ectra l s ig nals and the 
results obtained are usually superior to those obtained by other techniques. 
Even this method cannot be successfully applied to single lines when the 
number of signal values in the vicinity of the line is small. Thus further 
improvements in the curve fitting method are required. Some of these have 
been investigated during the past few months. Several approaches have 
heen considered and a review of our recent work is given below. 

Laboratory spectra are usually obtained from samples containing ar- 
bitrary amounts of the absorbing gas and at arbitrary total pressures. 

Very low pressures are usually used if the highest resolution of the spec- 
tral details is required. This Improvement is due to the decrease in the 
line widths with pressure. As the pressure is decreased, the line shape 
changes from the Lorentz shape to the Voigt shape. When the line widths * 
are smaller than the spectral resolution, it becomes increasingly difficult 
to retrieve the line intensities and widths. 

These considerations suggest that, for a given spectral line, observed 
with an instrument whose performance characteristics are known, there are 
some optimum sample conditions which allow the best estimates of the line 
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parameters to be obtained. The estimation of" these optimum conditions is 
a problem of experimental design. Although the principles of experiment 
design have been applied to many types of investigations, there have been 
few applications of the s_e concepts to the design of spectroscopic experi- 
ments. 

Dr. E. Niple has investigated the types of information contained in 
spectra of Lorentz shaped lines obtained with instruments of finite reso- 
lution by using the statistical methods developed by, for example, Fisher 
and Kullback. 

As a result of this work, two articles have been prepared and copies 
are enclosed with this report. The first of these, "Information Measures 
in Nonlinear Experimental Design, " describes the general method used- to 
identify the optimum experimental conditions required to obtain the best 
estimates of a desired parameter. The second "Information in Sp.ectra of 
Collision Broadened Absorption Lines" describes the application to Lorentz 
shaped lines. The approaches described in these papers can Jbe applied to 
many other spectroscopic problems. 

More recently. Dr. Niple has begun to use these methods to investigate 
the types of information which can he expected from spectra of the strato- 
sphere. The experiment chosen for analysis was that of occultation solar 
spectra obtained with an instrument of moderately high resolution. Some’ 
preliminary results are described in an interim report currently being 
prepared. 

These approaches allow optimum experimental designs to be evaluated 
and also the nature of the laboratory spectroscopic data required for the 
analyses to be quantitatively described. 


Concurrently with these investigations, analyses of the type of in- 
formation which can he retrieved from spectra of Voigt, shaped lines have 
been undertaken. The attached article "Least Squares Analysis of Voigt 
Shaped Lines" has been accepted by JQ.SRT. This article shows that a number 
of parameters of Voigt lines can be estimated provided a sufficient number 
of signal values are available for analysis. 

These investigations have shown that it is possible to define optimum 
configurations for spectroscopic measurements and that all the important 
line parameters can be estimated from a single spectrum of an absorption 
line. They also indicate that it is unlikely that all the line information 
required for spectroscopic investigations of the stratosphere can be ob- 
tained by a line by line analysis of ozone spectra obtained with an instru- 
ment of 0.04 cm -1 resolution. 

However, the aim of many analyses of high resolution spectra of gases 
is to describe the individual line characteristics by models containing 
far fewer adjustable parameters than the number of lines observed. Although 
the values of these adjustable parameters are usually obtained by fitting 
the models to the measurements of individual lines, it is clear that a 
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considerable reduction in the time and effort required can occur if the 
parameter values are obtained by simultaneous analyses of all the lines 
in the band. From this viewpoint it is also clear that, even though 
there may not be sufficient information in a given series of spectra to 
allow estimates of the individual line parameters to be obt ain ed separate^ 
ly, estimates of the important band parameters can still be obtained by 
collecting all the information concerning them from all of the lines si- 
multaneously. 

This approach has been tested by analyzing low resolution (0.1 cm -1 ), 
rather noisy spectra of HC1 and CO. Some initial results are described 
in the attached article "Band Analysis by Spectral Curve Fitt ing . " The 
results are very encouraging. 

These, and other investigations by our group were described in a 
series of presentations at the Thirty-Fourth Symposium on Molecular Spec- 
troscopy, The Ohio State University, June 1979. Abstracts of these tails 
■are attached. 

The results obtained thus far indicate that a substantial increase 
in the precision with which band parameter values can be estimated can be 
achieved by analyzing higher-resolution, less-noisy spectra. Bands of 
HC1 and CO consist of well separated lines, in marked contrast to the 
densely packed lines in bands of 0 3 . Thus the extension of these whole 
hand analysis techniques to ozone spectra is a formidable undertaking. 

Me intend to proceed in this direction by analysing bands of U 2 0 and C0 2 , 
before tackling the more complicated problems of ozone. The band models 
for these molecules are considerably less complicated than those of ozone. 
However, the presence of overlapping bands of isotopic molecules and other 
hot bands will serve as a crucial test of this method. 
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ABSTRACT 


Some different approaches to the problem of designing 
experiments which estimate the parameters of nonlinear models 
are discussed. The assumption in these approaches that the 
information in a set of data can be represented by a scalar 
is criticized, and the nonscalar discrimination information 
is proposed as the proper measure to use. The two-step decay 
example in Box and Lucas (1959) is used to illustrate the main 


points of the discussion. 



Introduction 


Many experiments involve estimating the parameters of nonlinear 
models. The purpose of this paper is to collect together and organize 
some different approaches to the design problem for such experiments. 
The basis for this organization is what Rosenkrantz (1970, p.58) has 
called "the obvious analogy" between experimentation, or measurement 
processes, and communication processes. It will be shown that this 
analogy allows the design problem to be treated as an Information 
Theory (IT) problem. Since IT is interdisciplinary in nature, the 
results can be applied to many different fields, in our case, experi- 
mental physics and chemistry. Most of the examples discussed below are 
* 

from these areas. 

There have been several previous attempts to pursue this analogy.* 
We shall not review these here because they have not found wide spread 
use among experimental designers. We feel that this is due, in part, 
to several inappropriate fundamental assumptions carried over from 
Communication Theory without sufficient justification. (By Communi- 
cation Theory we mean that impressive body of work which followed from 
the papers of Shannon and Wiener.) Principal among these assumptions 
is that the formal quantity which represents the information in a set 
of data has to be scalar. Equally unfortunate is the neglect in 
these approaches of what may be called (with apologies to Einstein) the 
Relativity Principle of Belief, which states that the information in a 
set of data is independent of the beliefs and prior expectations of the 
experimenter . 


*Perhaps the most well known is that of Brillouin (1956) . Many more 
can be found in the journal Measurement Techniques , 


- 1 - 



We do not need to propose a new IT approach to the nonlinear 
design problem. Instead, some of the non-IT approaches currently 
in use are reviewed and it is shown that these can be restated in 
terms of an existing IT formulation whose roots lie in statistics 
instead of Communication Theory. 


The Box Lucas Approach 

Perhaps the most theoretically developed of the non- IT approaches 
is that introduced in the well known article by Box and Lucas (1959) 
and extended by Draper and Hunter (1966). (See St. John and Drap er 
(1975) for a recent review of the use of this technique.) The general 

outline of the problem they discussed is (Box and Lucas (1959; p.77)) 
"...some response n is a known function 

n « f(5 1 ,? 2 ,...,5 lc ,02» 0 2’. , *> e p) = (1) 

of k variables whose levels are denoted by the elements 
of the vector £ and of p parameters 0^, 

...,0 ,...,6 elements of the vector _§_... The problem 
here considered is that of selecting a programme of trials 
[i.e., a set of experiments] such that they may be expected 
to provide results from which the p parameters can be 
estimated with high accuracy. In general the experimental 
programme may be defined by an Nxk matrix D = called 

the design matrix. The ut“ row £! of this matrix with 
elements £ u »••.»?£ ,...,5^ provides the levels of the 
k variables at whicR the response is to be observed in the 
u th trial.' 1 

We assume that this problem may be reformulated as : how can the 

most information regarding the parameters be obtained? Box and Lucas 
appear to share this assumption because, when they consider the in- 
clusion of experimental costs, they write (p.80), "One could then 
attempt to find the design giving the most information [italics added] 
for a given cost.” In a later article (Box (1970, p.585)) the Box and 
Lucas design criterion (defined below) which is to be maximized for 
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the optimum design, is even referred to as the "information" of a set 
of data. It should be noted, however, that this criterion is not a 
valid information measure, as such are usually defined in IT, since 
it is not additive for independent sets of data. 

fch 

Assume now that y , the observed response for the u set of 


experimental conditions 5 » is such that 


E(y u ) = \ = f (£u;0), 


and 


E (y -n ) (y -n ) 
7 u u -'v v' 


f o’ 2 , u = v 


u,v = 1, . . . ,N, 


( 2 ) 


(3) 


0 , u ^ v 

where a 2 , the variance of the observed responses, is unknown in general 
One estimate for 6^, the true values of the parameters, is the least 

t A 

squares estimate, JL This minimizes the sum of squares 
N 

S - f (JLu;S )} 2 


u— 1 


u 


and has an approximate variance-covariance matrix S_ given by 

S « o 2 {F , F} _1 , (4) 


where 


and 




ru 


- ft . 
0=0 


The Fisher information matrix W, used below, is related to S_ according 
to 

W = S_ _1 = 1/a 2 {F’F}. (5) 

The Box and Lucas technique requires that the design matrix ID be 
chosen so that the determinant |j?'fJ (or equivalently j w[ ) is maximized 
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Thus the determinant of the Fisher information matrix is used as the 
information measure in this approach. Such a design is referred to 
as a D-optimal design (Kiefer and Wolfowitz (1959). 

When p“l, i.e. there is just one unknown parameter, maximizing 
|w[ corresponds to minimizing ct 2 ^, the variance of the parameter 
estimate. This is an obvious choice in this instance; however, it 
is by no means obvious that |wj should be used when p §r 2. 

A Specific Example 

To illustrate the difficulties encountered by using this approach, 
we return to the example discussed by Box and Lucas . This involves ob- 
serving the yield p as a function of the observation time £ of an in- 
termediate species B in the consecutive reaction 

A 9i » B 62 »C, (6) 

G • 

where each — represents a first-order, irreversible decay with 
rate 0j. It may be shown that 

h(C 1 ) - {0 1 /(0 1 -8 2 )Hexp(-0 2 ? 1 ) - exp(-0 1 ? 1 )}. (7) 

If just two values of the yield are to be observed (k=2), the Box and 
Lucas optimum design is obtained from Figure 1. Here the contours of 
|f| are shown as functions of the two observation times and ? 12 ' 

The design criterion is simply 

|F'F | = { |f| } 2 

when k = p. In order to estimate the optimum design, the values of 0^ 
and 8 2 must be estimated. We have used the same values as Box and 
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Lucas : 

0 , = 0.7 and 0 = 0.2. 

1 2 

Figure 1 is similar to Fig. 3, p.83 of their article. The D-optimal 

design occurs at 

5 n = 1.23, g 12 = 6.86 (8) 

with jF’Fj = 0.66. 

For this simple case of two observations and two unknowns, the 
values of 0j and 0 2 are estimated by solving the simultaneous equations 

y l = f 0 ]/ ( e 1 -e 2 )Hexp(- 0 2 5 11 ) - exp(-'0 1 € 11 )}, 

y 2 = {0 1 /(e 1 -0 2 )Hexp(-0 2 ? 12 ) - exp(-0 1 S 12 )>, 

where y l and y^ are the observed yield values. The variances of these 
estimates can be obtained from (4) or from the standard propagation of 
errors formulae (Bevington (1969, p.59)). For example, the variance of 
0 is 

cr 2 0 = (90 1 /3y 1 ) 2 cr 2 y + (30 1 /9y 2 ) 2 o 2 y . (9) 

o n o 

Under the assumptions in (2) and (3), this gives a * = cf = o , and 

y l y 2 

some straight forward algebra verifies that (9) yields the same result 
as (4). 

Another approach to nonlinear design considers these variances as 

the relevant design parameters (see e.g., Hernandez (1978), Kovanic 

(1970), Pass! (1977)). Figures (2) and (3) show the contours of c 2 /o 2 , 

°1 

and a 2 / cr 2 , respectively over the same range of times as Fig. 1. 

0 „ 
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The maxima (corresponding to minimum variances) occur at 

? 11 = 7 - 8 ’ ? 12 ° 1 * 15 » (Fig. 2) (10a) 

and 

*H = 5.9, 5 12 = 3.45. (Fig. 3) (10b) 

Clearly, if only 8^ is to be estimated, the design in (10b) is better 
than that in (8) selected by the Box and Lucas approach. Similarly, 
the design in (10a) is better if 0 alone is to be found. Even if 

both parameters are to be estimated, one may be more important than 

the other. No allowance for this possibility is made in the Box and 
Lucas approach. What, then, justifies the "optimal" nature of (8)? 

Some Other Criteria 

Accord in g to Day (1969, p.110) the usual procedure in linear 
design is to choose a design in which the various parameter estimates 
are uncorrelated. 

The correlation coefficient for the two rates 8 1 and is given 
by 

c = s 12 / v/ s 11 s 12 , 

and the contours of this quantity are shown in Fig. 4. It is seen 
that there exists a range of different uncorrelated designs (C = 0) for 
this example, although nonlinear problems do not always have them 
(Day (1969,p.ll)). Whatever the advantages of such designs may be, 
in this case. Figs. 2, 3, and 4 show that these do not include 
minimum parameter variances. 
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Lees (1970) discusses the use of a different quantity for non- 
linear design. The matrix W is modified according to 

w* - cwc, 

where the diagonal matrix JO is given by 



and C = 0 , r ^ s . 
rs 


W* is then diagonalized and the value of the smallest eigenvalue 

* 

examined. According to Lees (1970, p. 127) "...a small [the eigen- 
value] definitely indicates a near linear dependence [among the 
unknown parameter estimates]." Such a dependence indicates a poor 
design due to the difficulties in numerically inverting the data to 
retrieve the parameter estimates. 

Figure 5 shows the contours of the smallest eigenvalue of W* for 
the decay example. While this quantity may be useful in deciding 
which designs are worst, it is unclear how it could be used to decide 
which one is best. 

Day (1969, p. 110) has suggested several other quantities, among 
them the spectral condition number of W (similar to Fig. 5) and the 
trace of W (shown in Fig. 6), which may be better than |wj . 

It seems to us, however, that the problem lies not in deciding 
which of the scalar measures in Figs. 2-6 to use in place of | wj , 
but rather in determining whether any scalar measure is adequate. 
Figures 2 and 3 offer convincing evidence that no scalar is . 
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As Rivkin (1968, p. 230) has noted, this same problem arises in 
approaches which use scalar Communication Theory measures such as 
the entropy. 

The crucial question which needs to be answered is: what type of mathe- 
matical object (scalar, vector, etc.) should be used to represent the 
information in data ? Most approaches to nonlinear design have assumed 
that a scalar should be used, but this is unacceptable when multiple 
parameters are involved. 

Exploring the entire range of possible types would be a formidable 
task, fortunately, however, there already exists an alternate approach 
which uses a nonscalar measure whose properties have long been known. 

The basics of this were outlined by Fisher (1939) prior to the work of 
Box and Lucas or even the pioneering work of Shannon and Wiener in 
Communication Theory. As extended by Good, Kullback, and others, this 
approach defines an information measure which, for a given set of data, 
is a mathematical operator that takes as input two hypotheses and H 2 
about the nature of the physical system being studied and produces as 
output a number. This number measures the amount of information in the 
data for discriminating between H^' and . 

Details of the Discrimination Information 

Most of this section is taken from Kullback (1959). 

Let each hypothesis IL» i-1,2 be such that it defines a probability 
space ( X , L, }k) composed of a set of elements xeX (called the 
sample space), a collection L of all sets of elements from X > and a 
probability measure y^ over the elements of L. Assume that y^ and 
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P 2 are absolutely continuous with each other and with a third measure 
X. (Two measures a and g are absolutely continuous with each other, 
written as o = g, when there exists no set E £ L such that a(E) = 0 
and g (E) =f 0 or g (E) = 0 and a (E) ^ 0 . ) 

By the Radon-Nikodyn theorem, each hypothesis Ik has associated 
with it a generalized probability density f_^(x) such that 


l*i(E) = Jf ± (-x)dX00 


for all Eel except for sets of measure zero in X. 

If each element x represents a possible set of observed responses 
Hi,. . . ,n n for some experiment, then f^(x) is the probability density 
of observing the particular set x given that Ik is true. When f^(x) 
is considered as a probability over hypotheses with x fixed, it is 
referred to as a likelihood (Box and Tiao (1973, p. 10)) . 

Kullback (1959, pp. 4-5) defines the quantity 

[f i (x) 


I(l:2;x) = log 


f 2 (x) 


(ID 


as the information in x for discrimination in favor of Hj against H 2 . 
From Bayes' theorem (Box and Tiao (1973, p. 10)) (11) is equivalent to 


I(l:2;x) 


log 


' 

pChJx) 

p(h 2 |x) 


log 


P(Hj) 

P(H 2 ) 


( 12 ) 


where P (Ik ) and P (Ik | x) are the prior and posterior probabilities of 

H . . This quantity measures the relative change in the probabilities 

t 

of H 1 and H 2 as a result of observing x. (See Osteyee and Good (1974, 
pp.1-17) for a discussion of the relationship between I(l:2;x) and 
the information measures of Communication Theory.) The expected value 
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of l(l:2;x) , given that H 1 is true, is 

1(1:2;*) = 1(1:2) = Jf i (x)I(l:2;x)dA<x) 

X 


-I 


log 


X 


P(H, ]x) 


P(hJx) 


dy 1 (x) - log 


P(H l ) 


p(h 2 ) 


( 13 ) 


Although I(l:2;x) and 1(1:2) can be expressed in terms of the 
prior and posterior probabilities, they are actually invariant with 
respect to changes in these probabilities since they only depend on 
the likelihoods. This is not true for the entropy or the semantic 
measures discussed by Hilpinen (1970). Commonly there are two dif- 
ferent interpretations given to P(IL) In communication problems 
(Jamison (1970, p. 29) j Box and Tiao (1973, pp .12-18) ) . One is in 
terms of frequencies of occurrences for IL among all the different 
hypotheses. These frequencies are usually much more difficult to 
determine than the likelihoods for most nonlinear experiments. A 
second interpretation is in terms of degree of belief (Box and Tiao 
(1973, p. 14)), in which case the entropy of a set of probabilities 
becomes a function of the observer f s belief system rather than of 
the experimental situation. While in some instances such a depen- 
dence may be desirable, our conception of the present problem requires 
that the information be strictly a property of the experiment and the 
data it produces. This is the Relativity Principle of Belief. 


Connection with Fisher Information Matrix 

Let Jf be a homogeneous set (or space) of hypotheses, i.e. H. , 
ILeff implies Then the measure 1(1:2) is called (Kullback 
(1959, p. 7)) a directed divergence in this space and performs much 
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the same function as a metric or distance. Suppose that H is also 
parametric with each member represented by a vector 0_ in some open 
convex set in a p-dimensional Euclidean space and that each f ^ (x) = 
f (x; 0) is of the same functional form. Under certain regularity 
conditions, I(8:8+A8) can be approximated by a linear operator L 
for A0_ sufficiently small (Kullback (1959, pp. 26-28) ) , giving 

1 (6 = 8+A8 ) ^ L (Q,6+A6) . (14) 

Each coordinate representation for H produces a matrix representation 

of L , and Kullback shows that this matrix is simply % times W, the 
op — 

Fisher information matrix. 1(1:2) under these conditions is there- 
fore similar to the metric of a curved space or the nertia of an 
extended object, and all the scalar measures introduced above 
(jw|, C, etc.) merely represent various aspects of this nonscalar 
measure. (Figure 7 shows the contours of I(8_:0) = 8_’F/F_ 9_, with a 2 = 1 
for the decay example.) 

It should be noted, however, that the information is an operator 
whose expected value 1(1:2) for two hypotheses is invariant with 
respect to coordinate transformations in the space used to specify 
the hypotheses. The Fisher information matrix W, on the other hand, 
changes for different coordinate representations. 


Information "Components" 

Another set of useful scalar measures are the quantities 


I 

r 



r = 1,.. . ,p 


(15) 
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which measure the components* of information in the data associated 
with the p parameters. Each one is a valid information measure which 
gives the amount of information for discriminating between two hypoth- 
eses whose associated probabilities are normal distributions with 

variances a 2 and means 0 and 0, respectively (Niple (1978)). Con- 
8 r r 

tour plots of these quantities for the two decay rates 0^ and 0^ are 
similar to Figs. 2 and 3 (when a 2 is independent of £ and *12> 
except the scales are multiplied by constants. The positions of the 
maximum information components about 9^ and 0^ are thus given in (10). 
Maximum information component corresponds to minimum percent uncertainty 
%0 r , where 



V ' 6 r 


x 100%. 


Discussion 

\ t 

Many approaches to the problem of nonlinear design suffer from 

i 

overly simplified assumptions about the mathematical complexity of the 
quantity used to represent the information in experimental data. The 
example of a two step decay process has illustrated this . No one 
optimum design suffices for this experiment because different experi- 
menters performing the same experiment may be interested in different 
aspects of the decay. For this reason, the discrimination information 
measure, which is nonscalar, is a more realistic choice. It incorporates 
many different aspects of the information, and individual experimenters 
can then select those aspects most relevant to their purposes when 


*The word "components" is perhaps misleading since it seems to imply* that 
the information is a vector. The word "pieces", introduced by dePristo 
and Robitz (1978) in a somewhat different context may be more appropriate. 
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designing their experiments. Thus the question: which design yields 

the most information? is ill-posed since the word "most" implies a 
scalar measure. A better question is: what design yields the best 

information? where now the word "best" allows for the multifaceted 
nature of the information in a set of data. 

The discrimination information provides a rigorous and general 
formalization of the nonlinear design problem which includes as 
specific cases the other approaches discussed above. 

The insights gained by this approach to design should allow a 
number of previously intractable problems to be analyzed. The design 
problem for isolated absorption lines has already been analyzed (Niple 
(1978)) and solutions to additional spectroscopy problems will be 
given elsewhere. 

The expected information for a particular design can be a function 
of the unknown parameters as well as the design matrix D. Because of 
this, estimation of the information depends also on estimation of 0_. 

A complete treatment of the resulting determination of an optimum 
design will involve prior probabilities and utilities in the usual way 
(Rosenkrantz (1970)). At an informal level, a knowledge of the coarse 
variation of the information components together with some of the other 
scalar measures should be sufficient for most applications, although 
situations in which the cost of the designs is a rapidly varying 
function of the design parameters may require a more involved treatment. 
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ABSTRACT 


A mathematical model is introduced for experiments 
which yield spectra of isolated, collision broadened 
absorption lines. A nonlinear design analysis is then 
performed on the model to identify the optimum experimental 
conditions. The information in the spectra is split into 
components associated with each of the unknown parameters 
to be estimated from the spectra. By exploring the variation 
of these components it is shown that spectra with line 
center transmit tances of 0.25 are nearly optimum for 
measuring the line position, the Lorentz width o(, the instument 
resolution H, and the product of line intensity and 
absorber amount. The trade off between signal to noise 
ratio (SNR) and resolution Is investigated for several 
different resolution dependent noise levels. The analysis 
shows that when SNR is proportional to the square root of 
the ratio of resolution and Lorentz width, a single optimum 
resolution and line center transmittance combination of 
H/o( “ i.5 and 0.28, respectively, exists. 



Introduction 


Qualitatively there are two kinds of data: good data and bad 
data. The goal of Experimental Design is to determine* ahead of time, 
which kind a particular experiment would be likely to yield. The 
influence of the signal-to-noise ratio (SNR) on the quality of 
experimental data has long been recognized, yet other effects may 
also be significant. In spectroscopic experiments the spectral 

resolution is equally as important as SNR, and the data quality is 
often determined by a trade-off between these two. For the purposes 
of experimental design, therefore, it is important to have a 
quantitative criterion which describes all the factors which influence 
the outcome. 

Such a multi-faceted criterion was discussed in a previous 
paper (referred to as NS-l). It is the discrimination information 
measure developed by Fisher ^ 2 \ Good^\ Kullback^ and others. 

In the present paper, the information components, introduced in NS-l, 
are used to analyze experiments in which spectra of an isolated, 
pressure-broadened, (Lorentzian) , absorption line obtained by instruments 
of finite resolution are collected. Such spectra have been analyzed 
previously e.g. Chang et al^\ Although qualitative descriptions 
of the type of spectral data most suitable for analysis were given, 
no quantitative investigation of the experimental design was made. 

It is hoped that the results presented below will lead to a 
better understanding of the power of the discrimination information 
measure and, more generally, to improved techniques of experimental 
design in Spectroscopy. 
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Pressure Broadened Lines 


Consider an experiment which measures the intensity of radiation 

transmitted by an absorbing gas sample at a uniform temperature and 

total pressure P T . When P T is sufficiently high, the line shape is 

(6) 

determined by collision broadening, y and 'is Lorentzian, The 
signals y. are recorded at N equally spaced frequencies V. from 
some lower frequency to an upper frequency 

■9 + (n-i)AV, (i) 

U I* 

where is the constant frequency spacing. If the noise in the 

signal has a normal distribution with rms value and is independent 
of frequency, the form of y in the vicinity of an isolated absorption 
line is described by 


€. 

l 


+ B(^ 




R(i> ,t\;H)exp -SuK^P^/fT’ 


(* Y 

o 



,( 2 ) 

dV 


i = 1, 


,N. 


In this equation B(SO represents the background signal with the 
absorber removed and is assumed to be of the form 

B("^ i ) = a + b( V ± - (3) 

with b<=?0 and the mean frequency. This gives a background nearly 

independent of frequency. R(S>, V^H) is the instrumental spectral 
response function and is assumed to be of the form 

«(. -1 ;H) = (1- h>'- ^I/HJ/H, K - V.| ^ H, 

= 0 , 

This form is triangular with full width H at half height. £. . 
represents the random noise . 
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Equation (2), which is called the model for the spectrum, assumes 
that the combined effects of foreign and self broadening can be 
represented by the single term o^P^. The line parameters S and^ Q 
are the line intensity and the pressure broadening coefficients res- 
pectively* V q is the line center frequency, and u , the absorber 
amount, is the product of radiation path length and absorber partial 
pressure. 

A typical spectrum calculated from the model is shown in Fig. 1. 

Also shown in this figure are two parameters W Qbs and t^ that are 

used to describe absorption features in general. The width of the line 

W at the half peak absorption points is a complex function of the 

instrumental response function width H and the line parameters. 

The transmittance at line center is defined as t . 

P 

* 

There are twelve parameters involved in the model: the three line 

parameters: V , S, andod^, and nine physical parameters : 9^, 

2 

N, a, b,'H,o- , u and P^. There are, two types of experiments which 

require the collection and analysis of spectra described by this model. 

In one , the line parameters and S are known and the physical 

parameters u and P^, are retrieved from the spectrum as well as 

2 

a, b, H, and a— . In the other, P^ and u are known and the 
line parameters are retrieved instead. In both cases N 

are also known, and the goal is to retrieve the remaining seven unknowns. 
One way to effect this retriev'al is by nonlinear Least Squares Regression 
Analysis, which is summarized in the next section. 
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The Least Squares Variance-Covariance Matrix and Information Components 
When a set of observations is described by the model 

y i “ € i + ^^l***** S ( k )? V i = (5) 

where f is some function of k known variables Si***-! l k and 
p unknown parameters and is a normally distributed 

-L p 1 

2 

random variable with variance cr' and mean zero, the asymptotic 

■ft ft 

variance-covariance matrix of the Least Squares estimates 0 .,.., g 

^ P 
(7) 

for the parameters is given by 


=£ 2 (j* j r 1 


(6) 


A 2 

In this, o-' is the estimated noise variance and J is the Jacobian 
matrix of f with 


(l) Uj 

hj = ; 8 , ,6 f ) 

S.,-9 




(7) 


The diagonal element of S is the asymptotic variance 


A A 


i~\; of and S. ./c w . 7 - is the asymptotic correlation coefficient 
J J 13 J X 

A A * 

between 0. and 0.. The asymptotic percent uncertainty % B. is 
0 x 3 

therefore given by 


% b . = 100 ( £;/ e.) . 

j j j 


( 8 ) 


The values of 0 0 are those which minimize the sum of 

square deviations 

<«e>, jr [y.-ftrf 


In NS-1 the quantity 


I i ~ ( §./£\) 2 


(9) 


k 



was recommended as a measure of the component or ’’piece" of information 
in a set of data about the unknown parameter. From Eqs.(8) and (9) 

„ a 

it is seen that 1^ is large when the per cent uncertainty % is small 
and vice versa. This corresponds to the intuitive sense thaij if a set 
of data leads to a parameter estimate with a large uncertainty for 
the jk parameter, as reflected in a large $0^, then that data set 
does not contain very much information about 0.. (See NS-1 and its 
references for a more formal justification of the term information 11 
above ) . . 


The Information Components for Equation (2) 

From Eqs. (6), (?) and (10) it is seen that each 1^ depends 

upon the £5*.[ an ^ as well as the £ 0^ . For the model in Eq. (2) 

this means that twelve different variables can influence the information 

content of a spectrum. To simplify the analysis, the value of b was 

fixed at zero in all spectra analyzed. This is not the same as 

assuming that b is known before analyzing the spectrum. It can be 
/ o \ 

shown' ' that the remaining eleven variables contain several redundan- 
cies (related to symmetry properties of the information) so that each 
of the information components for S, oC Q , a, H (or equivalently 
u, a, H) can be written as 


I. (SNR; N F.(E,C,H/«,H/co), (10) 

J 0 

where F. is some complex function of the four variables E,C,jy»c and H/x , . 
3 


These are defined as: 


E £ cv \)/« obs 
SNR = a/V, 
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C S <*0 



and oC — c 6 P t . 

The extent of the spectrum £ is measured in observed widths. 

C specifies the centering of the line in the observed frequency- 

range, and a value of 0.5 corresponds to an exactly centered line. 

The approximate nature of Eq. (10) is due to the N dependence. (The 

validity of this approximation is discussed below.) 

In order to present the variation of the information components 

in a meaningful and easily understood manner, the values of E and 

C were fixed at 10 observed widths and 0.5006 respectively, and a 

range of values for Q/x* and H/6(, was selected. Representative 

values for SNR and N of 100 and 300 respectively were then used 

to generate three dimensional surfaces for each I.» These can be 

3 

shown as either two dimensional projections or contour plots. 

The ranges of values for -0-/*. and H/x. are shown in Fig. 2. 

Also shown are the contours of t over these ranges and several 
examples of spectra generated from Eq. (2). These spectra were 
calculated for fixed values of N, E and C and five pairs of values 
for 0-/cc and H/eo . For the purpose of reference, spectra corres- 
ponding to points in the lower part of Fig. 2 correspond to weak 
lines (as indicated by a line center transmittance > 0.85) and those 
at the top to strong lines. Similarly, spectra corresponding to 
points at the left correspond to high resolution and those on the 
right to low resolution. Figure 3 shows the contours of the observed 
width W Qb ^ over the same range of values with 06 « 0.07084cm -1 . 
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Before investigating the variation of the I. with E and N 

0 

fixed, the effects of changing each of these, one at a time, on the 
information content associated with spectrum C in Fig, 2 were 
investigated. Figure 4 shows how each of the I . for H, OC , 

-Q. = /SiaT and Su varies as a function of E. The calculations were 
done for 10 points / observed width, and the quantity 

- (V W' ; V 2 

was used for the line center component. Each I. was normalized to 

, «] 

one at E = 1G . Notice that, except for 1^ , each of the I. 

® J 

continues to increase with increasing extent even though 'there is no 
significant absorption beyond about five observed widths from line 
center. We attribute this increase to the correlations between the 
background parameters a and b and the other parameters. Bata points 
far from line center make the greatest contribution to the precision 
with which the background can be retrieved and thus they also in- 
directly contribute to the precision of parameters correlated with 
the background. 

Figure 5 explores the N— dependence in Eq. (10). The value 
of each I /N (normalized to unity at N = 600) is shown as a function 

c) 

of N, again for spectrum C in Fig. 2, with E = 10 . Provided N 
is larger than 7o ♦ the approximation is. quite good. This result 
also allows the definition of an average information rate which only 
depends on E, C, H/ec , -O- /<*. for instruments where (SNR) • N is 
proportional to the time required to collect the data. Often this is 
approximately true for both grating and interferometer type spectrometers. 
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Figure 6 shows the contours of I g^ over the range of spectra 

in Fig. 2. (Recall that SNR = 100). This will be the component 
associated with S when u is known, and the component associated 
with u when S is known. These contours are labelled with the 
value of the percent uncertainty ^Su rather than with the value of 
Isu’ s ^ nce f° rrae:r is more familiar. The distance between adjacent 
contour lines corresponds to a Idb change in % Su and hence to a 
2db change in ig u * This figure shows a ridge of high information 
content about Su for spectra with isu/ff oo 1.3 and H/^. ^ 2. 

From Fig. 2 it is seen that this ridge lies approximately along the 
tp = 0.25 contour. Consequently the best spectra for measuring Su 

have a resolution on the order of, or less than, <*, and a line 
center transmittance of ^ 0,25. Figure 6 also shows that little 
additional information is gained by improving the resolution beyond 
H /« ^1.5. This assumes that SNR is independent of resolution. 

More is said about the case where they are not independent in the 
discussion below. 

Figure 7 shows the contours of 1^ , again labelled with %o(,. 
These contours are similar to those of Ig except that, in each case, 

I ^ is strictly less than Ig^. We attribute this to the different 
types of information about the spectrum described by the parameters 
S and 06 . Although the effects of these parameters on the spectrum 
are by no means independent, nevertheless, in a crude sense, S = 
is estimated from the area of the line whereas cC is a shape factor. 
The results in Figs. 5 and 6 show that, in a given spectrum, there is 
more information about the equivalent width of the line than the 



V 
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shape. Also, the effect of improving resolution is more marked for 
oC than Su, as would be expected for any line shape parameter. This 
is seen in the steeper upward slope along the optimum ridge as H/cc 
decreases. 

Figure 8 shows the variation of the information component $H 
associated with the resolution parameter H. This figure is comple- 
mentary to Fig. 7. This is as expected since a line whose shape is 
determined mainly by the instrument spectral response function 
(H/oc large) contains little information about PC . Similarly a line 
whose shape is determined by collision broadening effects (H /o(, small) 
contains little information about H. 

Figure 9 shows the variation of The similarity between 

this figure and Fig. 2 indicates that the dominant factor in deter- 
mining the information about 0. is the line center transmittance t^» 
This figure was obtained from the values of 0“^ and in the 

previous figures along with the correlation coefficient between Su 
and oC according to the standard propagation-of -errors formulae. 

Figure 10 shows the contours of I (These are labelled 

with w obs /^v 0 i* 1 percent.) As with , the t^ influence is the 

2 

most important. In many cases, the quantity of interest is c "'s> 0 » 
the absolute uncertainty in the estimated line center. By comparing 
Fig. 10 with Fig. 3 of W it is seen that the lowest absolute 

line center uncertainty occurs for spectra with high resolution 
(H/o6 small) and intermediate strength ( v^Su/nvc ci 1.5) • Consequently 
the lines on the ridge of Fig. 6 for I g are nearly optimum for 

T Su’ T oc * and ^o* 
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The Case Where SNR Depends on Resolution 


As stated above, the percent uncertainty values used to label the 

contours in Figs. 6 through 10 are based on a uniform SNR of 100 

and N =; 300 points, Eq. (10) can be used to compute the values of . 

the I . for other values of these quantities. 

3 

It is interesting to explore the situation where SNR is a 

function of the resolution H. Many different choices for the functional 

dependence are possible, depending upon the operating characteristics 

of the spectrometer used to collect the spectrum. For grating spectro- 

(q) 

meters it is often found that , away from the diffraction limit, 

SNR oC W l , 

where ^ W g is the physical slit width. The relationship between W £ 

and H is complex and depends on the alignment of the spectrometer, 

but, for narrow slit widths, H is nearly independent of and, for 

large slit widths, H becomes linearly dependent on W^. For large slit 

widths, therefore, the SNR can be expected to be proportional to 

and, as H decreases toward some limit H . , SNR goes to zero. 

mm 

The noise dependence for interferometers Is discussed by Flemming. 

It is a simple matter to substitute an appropriate function of 

H into Eq. (10) for SNR and calculate values for any of the information 

components, given values for H, N, E, C, H/oc, , and-Q./*, , where 

the values of the F. are taken directly from Figs. 6 through 10. 

«J 

The resulting functions of six variables could be optimized on a 
computer, or by trial and error, to obtain the best spectral parameters 
for measuring specific unknown parameters i.e. Su. o6,u., etc. However, 
because of the many variables involved, the results of such an 


10 




optimization are very difficult to present graphically. We have there- 
fore chosen to assume that the interdependence of SNR and H (and N, 
where appropriate) can be expressed in the form 


SNR 



( 11 ) 


where d is some positive constant. The resulting information 

components can now be presented in contour plots by fixing E and C 

and including the N- dependence as a scale factor, as was done 

before. Figure 11 shows the effects of a resolution dependent SNR on 

the contours of I in Fig. 7, for the case where d=l/2. The 

optimum (labelled +) occurs at H/<*, & 1.5, Jl/tx, ^1.3i with the 

information falling off at 2db per contour line going away from this 

optimum. The corresponding locations of the optima for d = 1 and 

d = 3/2 are shown by the 0 and X, respectively. For d = 2 the 

optimum lies to the right of the figure. The track of the optimum, 

as d increases, is indicated by the dotted line, and is seen to 

follow the ridge of Fig. 7* Similar results were obtained for 

Such figures are similar to the well known resolution - SNR trade- 

( 11 ) 

off curves of Backus and Gilbert , except the importance offt/<tf 
(or equivalently the line center transmittance) has now been included 
as well. 


Discussion 

One of the conclusions to be drawn from the contour plots 
in Figs. 6 through 8 is that neither spectra of very weak lines nor 
spectra of very strong lines contain very much information about Su, 
dC or H. This is seen in the high percent uncertainties at the tops 
and bottoms of these figures. Spectra of very weak lines, especially 
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low resolution spectra, do not contain much information about 12. or 

S> o either, but high resolution spectra of strong lines do contain 

considerable information about these two parameters* All the figures 

indicate increased information with improving resolution (except Xjj ) , 

however, improving the resolution beyond E/oC -a/l leads to little 

additional information, provided _ the signal to noise ratio, SNR, 

. « 

is independent of the resolution. When the SNR varies as some power 
of the relative resolution H/°e , a unique optimum spectrum exists for 
measuring Su or <x- which has a larger H/oo value for higher powers 
of E/oC . 

Figure 4 shows the importance of including data points in the far 
wings of the line in order to measure Su, oC , H or XL , but not V . 

A similar analysis of the centering dependence (see Ref. (8)) showed 
that the line should be centered in the middle of the observed frequency 
range . 


Also apparent in the figures is the importance of the line center 

transmittance value, t for determining the information content* 

For Su, 06 , H and (assuming the absolute value of the variance 

is used) a t value of 25$ transmittance at highest resolution 
P 

yields approximately the best information for all these parameters. 

We feel that the results given above are best viewed within the 
Information Theory framework discussed in NS-1. In this framework, 
Figs. 6 through 11 are interpreted as displaying various aspects of a 
multidimensional mathematical operator which formally represents the 
information in a set of data; in the present case, a spectrum. The 
overall implications of the figures agree with one's intuitive feel 
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for the problem, as , indeed, they should. For example, the conclusion 
that spectra of very weak lines contain little information about any 
of the line parameters (as indicated by small I , I and Io ) is 
well known to anyone who has ever tried to analyze such spectra. Yet 
the value of a quantitative measure that expresses what one can expect 
to learn from a set of data should not be underestimated, especially 
for the case where cost analyses are being performed on several 
competing experimental proposals. A reliable quantitative measure 
greatly simplifies the difficult decision of choosing between repeated 
performances of inexpensive but less informative experiments, and a 
single trial of a more expensive but very informative one. Hopefully 
the techniques presented in the present work will prove useful in 
such instances. 
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Figure Captions 


Fig. 

Fig. 


Fig. 


Fig. 


Fig. 


Fig. 


Fig. 


Fig. 


Fig. 


Fig. 


Fig-. 


1 - A typical absorption line of type described by Eq, (2). 

2 - The range of values for.fi. /oC and H /oC used, including 

contours of line center transmittance t , and five representative 
spectra corresponding to points in the r£nge. 

3 - Contours of the observed width W , (in cm -1 assuming 

0.0708 cm" 1 ). obS 

4 - The dependence of the Information components on the extent 

of the spectrum E, for the components associated with -y> o 
(line center), H (resolution), o(, (Lorentz width), 

Su and _Q_ . 

5 - The dependence of the information components on the number 

of data points N, for the components associated with 
~i> (line center), H (resolution), oC (Lorentz width), 

Su°and ,J2. . 

6 - Contours of Information component associated with Su 

(labelled with #Su) for N = 300, SNR = 100, E = 10 
and C = 0.5006. 

7 - Contours of Information component associated with 06 

(labelled with %oC) for N = 300, SNR = 100, E = 10 
and C = 0.5006. 

8 - Contours of information component associated with H 

(labelled with for N = 300, SNR = 100, E = 10 

and C = 0.5006. 

9 - Contours of information component associated with SI. 

(labelled with &Xi.) for N = 300, SNR = 100, E = 10 
and C = 0.5006. 

10 - Contours of information component associated with "9 

(labelled with <r~^> / W , x 100#) for N = 300, SNR = 100, 

E = 10 and C = 0.58o6.° bs 

11 - Contours of information component assoc iated with eC for N = 300, 

E = 10 and C = 0.5006 when SNR £2 iE/cc . 

+ indicates maximum. 0 indicates maximum when SNR ^ H/eO . 

X indicates maximum when SNR — (H / cC V' (spacing corresponds 
to 2db change). 
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ABSTRACT 


A method of estimating the values of the parameters in the 
models describing the positions, widths, and intensities of the lines 
in rotation-vibration bands of gases without the need for line by 
line analysis is described. To illustrate the technique, portions 
of the 1-0 bands of HC1 and CO have been analyzed. The values of up 
to 27 parameters, their standard deviations, and the correlations 
between the parameters required to describe the spectra have been 
obtained. 
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Introduction 


Experimental data (signals) are often analyzed by assuming a 

model and determining the values of its adjustable parameters. The 

models may be based on theoretical considerations or empirically 

derived. In either case, an often-used criterion for estimating the 

* 

best parameter values is that the sum of the squares of the residuals - 
the squares of the differences between the signals predicted by the 
model and the experimental signals - should be a minimum. Indeed, 
as Albritton, Schmeltekopf and Zare^ 1 '* have noted, if the appropriate 
model is linear in the unknown parameters "the method of least squares 
offers valuies that are the most precise (i.e. minimum variance ) 
unbiased estimates that are linear functions of the measurements," 
Corresponding assumptions are often made concerning non-linear models. 
In all cases, the objectives of the analysis include a desire to obtain 
the parameter values as precisely as possible, to minimize observer 
bias, and. to show the model used is an appropriate one. 

In some methods of analysis an intermediate set of parameters 
is first obtained from the experimental signals and, from this set, 
other characteristics of the system studied are inferred. Thus, for 
example, the individual lines in rotation-vibration spectra of gases 
can be described by a position, shape, and intensity. The values 
of these line parameters, e.g. the line positions, may be described 
by other relations containing fewer parameters. These band parameters 
are usually obtained by analysis of the line position values. If 
these line positions are obtained by the usual line-by-line analysis 
of the spectrum it must then be assumed that there are no correlations 
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( 1 ) 

between these line position values. Again quoting from Albritton et al 
"The correlations of spectroscopic measurements themselves (e.g. 'raw' 

line positions) apparently have always been implicitly assumed to be 

zero and indeed it would appear to be difficult to do otherwise (but, 

I 

admittedly, this particular area is unexplored) . " In addition, if 
the line positions are estimated by the observer, the errors associated 
with these estimates are based on the observer's judgment. Although 
these errors typically depend on the appearance of each line in the 
spectrum it is not usually feasible to weight the line position 
estimates individually. Indeed either the same weight ^ is given 
to all the line positions obtained or the weight is related in some 
ad hoc manner to, for example, the peak absorptance of the lines. 

In many spectra some fraction of the lines are blended and the infor- 
mation from these lines is xncluded or discarded base'd on the judgment 
of the observer. 

Thus the final estimates of the band parameters and their 
precision are usually based on a series of steps, each step requiring 
the judgment of the observer, and the end results depend on the skill 
of the observer. - 

In the present method these intermediate steps are omitted , 
and the desired band parameters are obtained directly from the experi- 
mental spectral data. Equal weights are assigned to all the signals 
in the observed spectrum and the band constants are retrieved directly 
by a non-linear, least-squares, spectral-curve-fitting method. This 
is the equivalent of weighting each line in a consistent and statistically 
valid way and the band constants are unaffected by observer bias. 
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Since this method eliminates the intermediate step of measuring 
individual lines and analyses the complete data set # more of the 
information present in the spectrum can be extracted than in conven- 
tional approaches. 

'In this paper, it is implicitly assumed that the noise in the 
spectrum is independent of position and signal strength, that it is 
randomly distributed with mean zero, and that the model is sufficiently 
accurate to represent the data. It is useful if the data are available 
in digital form and it is important that the experimental conditions 
do not change significantly during data collection. Spectra obtained 
by Fourier Transform Spectrometers (FTS) are particularly suited for 
this method of analysis since information about the entire spectral 
region is obtained simultaneously. Indeed the ability of FTS to 
acquire data rapidly requires a similar speed in the analysis if the 
full potential of this type of instrumentation is to be realized. 

In Fourier Transform Spectroscopy the experimental signals 
consist of interferograms. Our mterferograms are transformed into 
conventional spectra by programs supplied by the FTS manufacturer. 

It is not our purpose to evaluate the relative merits of different 
transformation techniques and it has been assumed that the transformed 
spectra are the experimental data sets to be analyzed. As in many 
statistical analyses it is assumed that there is no error in the 
independent variable (here the frequencies at which signal information 
is obtained). These positions are based on the wavelength of a 
helium-neon laser associated with the interferometer and they also 
show a small dependence on the optical alignment of the system. We 
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have detected small shifts in the positions of spectral features in 

similar spectra taken on different occasions. These may introduce 

small systematic errors in the values of the estimated parameters 

reported here, since we are unable to model them at this time. 

The usefulness of this technique has been explored by analyzing 

spectra of the 1-0 bands of HC1 and CO. These were chosen because of 

the simplicity of the band models and because the molecular constants 

have been previously determined with high precision. The analyses 

were performed with the non-linear, derivative-free, computer-program 

( 2 ) 

BMDPAR described by Brown et al by using an Amdahl computer system. 
This and similar programs have been successfully used to analyze other 
types of spectra^). 

The method requires that the line intensities, shapes, widths, 

and spectral positions be appropriately modelled and that the effects 

of finite spectral resolution on the appearance of the spectra be 

(4) 

included. Previous experience has shown that good values for the 
line intensities and widths are obtained provided there are a sufficient 
number of data points in the vicinity of each line and the line widths 
are comparable to the spectral resolution. It has also been found 
that the line positions are not strongly correlated with the other 
parameters. Since most gases have pressure-broadened half-widths of 
from 0.03 to 0.1 cm /atm at room temperature, these previous results 
suggest that, if the spectral resolution is about 0.1 cm" 1 , then the 
sample pressure should be near one atmosphere if the line widths are 
to be determined. The line widths of HC1 are strongly J dependent^'* 
and become very narrow for high J values. There is also a pressure 
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( 6 ) 

shift of the line positions . Thus, in addition to reanalyzing 
previously studied^ ^ spectra of HC1, a spectrum of the 1-0 bands of 

(7) 

CO has been investigated. The J dependence of the line widths 

/ON 

and pressure shifts' 1 are considerably smaller for CO than for HC1. 


Theory 

The theoretical bases for the models describing the rotation- 
vibration bands of gases are described in an extensive body of liter- 
ature. As the experimental spectral resolution increases the number 
of adjustable parameters required by the models increases. Only 
those parameters which have significant influences on the observed 
spectra should be included in the analysis. 

The spectra analyzed were obtained with a Digilab FTS having 


a nominal resolution of 0.1 cm . Analyses 


(5 - 9) of 


these and similar 


spectra have shown that the individual line positions can be estimated 
—1 

to about 0.002 cm , and the intensities and widths to a few percent. 

Spectra of the 1-0 bands of HC1 and CO analyzed here show two sets 

35 37 

of lines. These belong to the isotopes of H Cl and H Cl and to 
12 C 1 ^0 and 1 ^C 1 ^0 respectively. Lines of other isotopes and of hot 
bands have been ignored in this analysis. 

Provided the pressure of the gas sample is not too large, the 
positions of the lines are given by 

i it . . 

V. = E. - E. , (1) 

i l l * 

where i = 1 identifies a model parameter of ^C^ 0 or H^Cl , 

and i = 2 identifies the corresponding parameter of or H^Cl. 
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For these molecules 

E i = ~*bi + + ^J^J+l) 2 +H^J 3 (J+1) 3 t 

and = B^J(J+l) + D^J 2 (J+1) 2 +H^J 3 (J+l) 3 , (2) 

where the symbols have their usual meanings and allowed transitions 
occur for Aj = il. The values (10,11) Q f the IL terms are small 
and are not expected to be retrieved with any precision from the 
spectra analyzed here. 

1 «1 

The individual line intensities S . (m) (cm /molecule) can 

ox 

be represented *' 12 ^ by 

sj.(m) = (8Tf 3 /3hc)\> (m) Ie 3 (m)| 2 ( iml/Q. ) exp £-E. (m)/kTj , (3) 

1 Ol X “1 J 

where |r^(ih)J 2 is the square of the dipole, moment matrix element, 


Q.~kTAcB^, (4) 

is the rotational partition function, m = -J in the P branch 
(AJ = -l), and m = J + 1 in the R branch (AJ = l). 

(13) 

The square of the dipole moment matrix element can be expressed ^ 
as the product of a rotationless factor and a rotational or Herman- 

Wallis factor, 


|Rj.(m )| 2 = |r^.(0)| 2 (1 + +/? 2 m 2 ) , (5) 

where and^g 2 are assumed the same for both isotopes. Their 
values are small -for both HCl and CO. Eq. ( 3 ) may be written as 

sJi(m) = W_^(l +^-^10 ^(m) |m( exp j~E^(m)/kTj , (6) 


where 


W. = Stt |R o \( 0 )l 2 /3hcQ i , 
is constant for all the lines of a given isotope. 


(7) 
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The pressures of the samples were such that the lines have an 
essentially Lorentzian shape with absorption coefficients given by 

k m i^) = [s>) ( fm I VfrJ 2 +oc 2 d®J)j 1 * (8) 

where ct(|znj) are the line widths. Lines of both isotopes with the 

» t i 

same lm( value have been assigned the same widths, and deviations 
from the Lorentz shape in the far wings have been ignored. The 
dependence of oc(|m() on ( m| has been modelled by 


2 2 

ctgdml) = cC Q +0^(311 + o6 2 m + e6j|mp + 0^01 


(9) 


where the coefficients oC ■ are to be determined and oC (|m|) is the 

3 s 

half width at some standard pressure (?60 Torr). The corresponding 
widths at pressure P are 


oO(lmj) = Po£ (1m|)/P . 

s s 


(10) 


Initial values for the coefficients ot. were found by fitting Eqs. 

3 

(9) and (10) to previously reported values for the line widths 

Eqs. (l) - (10) can be used to give the combined absorption 
coefficient due to all the lines 


k b ( ^ " S^bi^' 

The corresponding monochromatic band transmittance 

T bt W) “ “P [-S,2 k bi Ni ”i] ' 


( 11 ) 


( 12 ) 


where are the numbers of absorbing molecules of each isotope in 
the path. If the absorbing path length is 1, the absorber partial 
pressure is P , and the sample temperature is T 

£L 


N. = f 1 P T L/P T , 
1 i a o s ’ 


(13) 
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where f\ is the fractional abundance (assumed known) of an isotope, 

T = 273K, and L is Loschmidt’s number. It is then found that each 
o 

of the terms on the right hand side of Eq. (12) contains a term of 
the form 

V. = W.N./Vf, (14) 

i z l ’ 

which consists of the unknown parameter 1 r\( 0)| 2 and other, known 
parameters. In the analyses, the estimated values of were obtained* 
The monochromatic transmittance can also be written as 

T, (V) = I.(S> )/ I (V ) , (15) 

bt t o 

or I. ( V ) = I(V)T,.(V), ( 16 ) 

t o bt 

where I Q (V ) is the signal observed in the absence of the sample 
and I t (V ) is the signal with the sample present. The corresponding 

t 

signal observed with a spectrometer of finite resolution 

I(V ) = I (V )T(*V ) 
o 

= yi o (V)T bt (V) ff"(V, • )d V ’//^(V, v')dv! (17) 

where <t'('» > , V) describes the shape of the spectral response function 

(5) 

of the instrument. Previous analyses of our FTS spectra have 
shown that ("P, V 1 ) can be well represented by a triangular function 
with full width at half height E, and that the signal, in the absence 
of the sample, 

I(V) = a + b(V -*V) +c(V-7) 2 , (18) 

o 

where the values of the parameters a, b, and c are to be estimated 

and *V is chosen to lie near the center of the spectral region 
analyzed. Typically, the background I (V ) of the sample spectrum 
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shows a marked dependence on frequency because of variations in the 
spectral radiance of the source and of the spectral detectivity of the 
detector. Most of these variations can be removed by ratioing the 
signals in the sample spectrum with the signals of a spectrum obtained 
with the sample removed. In such cases the background I q (V) is 
nearly independent of frequency and the last two terms in Eq. (12) 
are essentially zero. 

The above expressions enable the experimental, spectra to be 
modelled by a function containing 27 adjustable parameters. Eight of 
these (two background parameters: b and c, two Herman -Wallis factors: 

/3 and/S^* an d four H terms) are expected to have only a small 
influence on the spectra. Estimates of each of the parameter values 
are obtained by analyzing the entire set of signal values in the spectral 
region of interest. This includes signal values in the vicinity of 
blended lines as well as those where there is little line absorption. 
Estimates of the asymptotic • standard deviation of each parameter 
and the asymptotic correlation matrix are also produced. The reliability 
of the retrieved values depends both on the quality of the spectra 
analyzed and on the accuracy of the models used. The model accuracy 
can be estimated by comparing the spectrum calculated from the 
retrieved parameter values with the observed spectrum. The differences 
should be of the same order as the noise level in the experimental 
spectrum. 
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Results 


The method was tested by analyzing a synthetic spectrum of HC1 
calculated from the parameter values given in .Tables 1 and 2, under 
the heading Spectrum 1, These values were chosen to produce a 
spectrum similar in appearance to an actual HC1 spectrum analyzed 

(c) 

previously J by the conventional line by line approach. The sample 

conditions and half width parameter values were obtained from this 

previous work, the line position parameters from the data of Rao^ ^ ’ 

( 12 ) 

and the parameters defined by Eq. (5) from Toth et al . A signal 
to noise ratio of 100 was simulated by adding random numbers with a 
Gaussian distribution to the signal values. To reduce computation 
time only the spectral region containing the P(8) to R(9) lines was 
analyzed and all signal values more than 5cm ^ from a line center 
were removed. 

The analysis can be performed by constraining any number of the 
variable parameters to fixed values, or by fixing the ratios or 
differences of pairs of values. The data may be weighted, and the 
number of iterations and the tolerance for convergence specified. 

In these results, all the data were equally weighted, and a stringent 
convergence condition imposed. A large number of iterations may? be 
required before this latter condition is satisfied and many of the 
results described here were obtained after the specified number of 
iterations but before the convergence criterion was satisfied. 

The results obtained by analyzing the synthetic spectrum with 
no constraints are shown in Table 2 under the heading Spectrum 1. 
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These results were obtained after 16 iterations. Some representative 
parameter values obtained for intermediate iterations are shown in 
Table 3, together with the sum of the squares of the residuals. This 
sum was still decreasing after 16 iterations, and the convergence 
criterion had not been satisfied. The results in Table 3 show oscill- 
ations about the expected values and a slow convergence to the final 
results as the number of iterations increases. The estimated parameter 
values in Table 2 also include the asymptotic standard deviations as 
5.2736(9) H 5»2736 t 0.0009. In most cases the final estimated 
parameter values agree with the values used to calculate the spectrum 
within one or two standard deviations. The estimated values for the 
band centers were only printed to three decimal places. 

The root mean square error of the measurements can be estimated 
from the relation 

ff' = j^(residual) 2 / (N-nOj 1 ^ 2 , (19) 

where N (36U) is the number of signal values analyzed and m(27) is 
the number of variable parameters. The estimated signal- to -noise 
ratio obtained by using the value of ^(residual) 2 obtained after 
16 iterations 

<Sm) eat ■ V * )/V 16 ’ • 

- 100 . 20 . ( 20 ) 

This is close to the value of 100 used to compute the spectrum 
and indicates that a near optimum solution has been obtained. Additional 
iterations are expected to improve the estimates only slightly. This 
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increase in precision must be balanced against computer costs* A 
new spectrum must be calculated for each iteration and for the increment 
halvings required in any iteration. Most of the results described here 
were obtained by using inefficient methods of calculating the spectra 
and each analysis required about one hour of computer time. New 
programs now allow the same computations to be made four times more 
rapidly. 

The correlation coefficients between the variable parameters 
for this analysis of a synthetic spectrum are shown in Table 4. In 
addition to the expected high correlations between parameters such 
as V o , B , D, and H, other strong correlations also occur. In many 
types of spectral analysis only a subset of the available information 
is desired. If implicit assumptions are made concerning the influence 
of the unmeasured parameters on the spectrum, then the data in Table 4 
show that most empirical weighting schemes become subjective in 
their application. 

A spectrum of a sample of HC1 at ?0OTorr was then analyzed. 

This spectrum, shown in Fig. 1 of reference (5), has a signal -to -noise 
ratio of about 50. Information about the sample and spectrum is given 
in Column 2 of Table 1. The parameter values obtained after 12 iterations 
are shown under the heading Spectrum 2 in Table 2. Because of 
degeneracies encountered during the iterations, correlation coefficients 
and standard deviations could not be estimated. These degeneracies 
may have been caused by the poor quality of the data, the small number 
of lines analyzed, or the pressure shifts of the lines which were not 
modelled. They could have been avoided by fixing more of the parameter 
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values. The results are included because they show that most of the 
parameter estimates are reasonable. In particular the important line 
position parameters, half width parameters and band strength parameter 
values are in good agreement with those anticipated. 

t 

The results obtained by analyzing the spectrum of a sample of 
HC1 broadened to 50 Torr with are shown under the heading Spectrum 3 
in Table 2. This spectrum is shown in Fig. 1 of reference (5) and the 

sample conditions are given in Column 3 of Table 1. In this retrieval, 

* »t ’ -IP 

the differences were fixed to be less than 10 . The half 

widths of the lines in this spectrum are much smaller than the spectral 
resolution and could not be retrieved accurately, indeed, the estimate of 
0^2 reached the lower bound, and its standard deviation was not 
obtained. The large uncertainties in the half width parameters are 
reflected in a corresponding loss of precision of the spectral resolution 
and band strength parameters, as expected from Table 4. However, the 
parameters which determine the line positions are not so strongly 
correlated with the half width parameters and many have been retrieved 

with the highest precision of all the analyses in Table 2. The 
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intensities of corresponding lines of H Cl and H Cl are approximately 

the same and hence the precision with which the corresponding band 

parameters are estimated is similar. The estimates of the other 

parameters could have been improved by using the values of the half 

width parameters or the band intensity parameters obtained from the 

spectrum of the sample at 700 Torr in the analysis. We note that the 

estimated values of ^2^ obtained from both the experimental spectra are 

( 5 ) 

negative. A similar result has previously been obtained although 
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the precision of all the measurements is low. 

The experimental spectrum of CO shown in Fig. 1 has also been 
analyzed. The sample conditions are given in Table 1, Column 4. 

Initial values for the line position parameters were obtained from 
Rao^^ and Chen et al^^ , and the values for the constants in 
Eq. ( 5 ) from Toth et al . Estimates of only the first four half 

(lb 

width parameters DC- were found by fitting the data of Benedict et al 

since the m dependence of the widths is much less than that of HC1. 

A synthetic spectrum calculated with the parameter values shown in 

the last column of Table 5 showed good agreement with the observed 

spectrum after a uniform shift of 0.0384 cm ^ was applied to the 

observed data. This shift is included in the results shown in Table 5» 

all of which are based on the analysis of a single spectrum. 

The first set of results was obtained by fixing nine of the 

26 parameter values and making twelve iterations. The spectral 

resolution .was fixed to the value obtained from previous analyses of 
(5) (9) 

an HC1 spectrum although Chang et al have found that a value 

of 0.075 cm ■*" was more representative of the spectral resolution of 

a CO spectrum taken on the same day as that shown in Fig. 1. These 

authors estimated the spectrum analyzed was shifted by 0.0537 cm ^ 

with respect to the line positions in the Air Force Geophysical 

(15) 

Laboratories* listing of the parameters of atmospheric lines . 

Two of the background parameters, the Herman-Wallis parameters, and 
the H terms were also fixed because they are expected to have little 
influence on the spectrum and are unlikely to be retrieved successfully. 
In the next retrieval only three parameters were fixed and one 
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iteration performed to obtain estimates of the standard 'deviations of 

the other parameters. This Cftnfirmed that the parameters previously 

fixed have estimates with very low precision. Eight parameter values 

were fixed in the last two retrievals. The third set of results, 

obtained after four iterations, gave a considerable improvement in the 

X2 l6 

precision of many of the line position parameters of CO although 
the remaining parameter values were little changed. The last set of 
values was obtained after twelve iterations. No standard deviations 
were estimated, although the residual sum of squares was about 1 % 
lower than the previous two analyses. The results of four analyses 
are shown to indicate the stability of the solutions for differing 
conditions of analysis. 

Only a few comments on these analyses of CO are given.. . The 
fixing of the spectral resolution '(retrieval l) to an incorrect -value 
substantially increased the residual sum of squares compared to the 
subsequent retrievals. This incorrect value affected the half width 
parameter and band intensity parameter values, a result previously 
observed in the analysis of HC1 spectra. Some of the line position 
parameters were also affected by the poor choice of the spectral 
resolution. This may be caused by the blending of some lines in the 
experimental spectrum. 

The marked difference in the precisio.n of the line position 

parameters of the two CO isotopes is probably associated with large 

13 l6 

differences between the intensities of corresponding lines of "c 0 
12 l6 

and CO. However, when the relatively small numbers of lines of 
both HC1 and CO contained in the spectral regions analyzed is considered, 
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the precision with which the line position parameters have been 
estimated is satisfactorily high. The power of this method of entire 
band analysis is also indicated by the observation that previous 
attempts to use the spectral curve fitting method to analyze single 
lines of CO were unsuccessful unless several parameter values were 
fixed. 

The spectrum calculated from the parameter estimates of retrieval 4 , 
Table 5 is shown in Fig. 1 together with the differences between the 
observed and calculated spectra. The agreement is excellent. The 
dependence of the half widths on lm | is shown as the continuous 
curve in Fig. 2. The open circles are the experimental values of 

( 14 ) 

Benedict et al and the other values were obtained from the AFGL 

line listing^ 

Discussion 

A new method of obtaining unbiased estimates, standard deviations, 
and correlations between the values of the adjustable parameters in 
models of vibration-rotation bands, from experimental spectra, has been 
tested'. In addition to allowing the relative influence of each 
parameter on the spectrum to be assessed, the method can be used to 
test the merits of different models and to allow local anomalies 
caused, for example, by accidental resonances between close-lying states, 
to be detected. 

Although considerable effort is required to write the programs, 
a single spectrum can be analyzed by this method in far less time than 
is required by conventional methods. Thus, in the line by line 
method of analysis, the intensity, width, and position of each line is 



first obtained. These must be suitably weighted and combined to 
retrieve the appropriate band parameters. It is often necessary to 
analyze several spectra to obtain the desired line information. The 
present technique removes the necessity for measuring single lines and 
has a greater potential for extracting band parameter information 
from the spectrum. 

The precision with which each parameter is estimated depends on 

(l6) 

the appearance of the spectrum. This has been discussed by Niple 

for the case of single lines* It is clearly desirable to maximize the 

spectral resolution and signal to noise ratio. Many methods of improving 

the appearance of spectra, degraded by noise or finite resolution, by 

smoothing or partial deconvolution have been proposed. These may have 

value if no model of the "true" spectrum is available, and they are 

often considered useful to apply before the results are analyzed by 

conventional methods. However, if parameter values are to be estimated 

( 17 ) 

by fitting- the data to a model, Kullback and others have shown that 
manipulation of the experimental data before analysis cannot add to its 
information content and tends to remove information. In our method no prior 
manipulation of the data is required, and the non linear least squares 
method of analysis is generally accepted as giving the best estimates 
of the desired parameters when a non linear model must be used. 

The appearance of. the spectrum is also determined fay the physical 
conditions (temperature, pressure and absorber amount) of the sample. 

We have not explored these relationships although the results obtained 
suggest there is no unique set of sample conditions which will allow 
all the parameters to be estimated with optimum precision. Thus the 
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analysis of several spectra may be required. The present method can be 
modified to allow several spectra to be analyzed simultaneously^ or to 
allow weighted parameter values from other sources to be included in 
the analysis to obtain single best estimates of the parameters from 
the entire data set. 

A single spectrum may contain several bands of a molecule. If 
the band models contain common parameter values - for example if the bands 
have common upper or lower states - improved estimates can be obtained 
by simultaneous analysis of all of these bands appearing in different 
spectra. The ability to retrieve information is thus limited only by 
the quality of the experimental data and computer costs. 

We have not explored the strategies to be used to analyze an 
unknown spectrum. The spectral examples discussed here are of molecules 
whose properties are well known and good initial guesses of the 
parameter values can be made. In other cases, the appropriate band 
model may not be known, or the spectrum may consist of incompletely 
resolved lines due either to insufficient spectral resolution or to the 
close spacing of the lines. 

Thus, for example, if only very low resolution spectra of the 1-0 
bands of CO showing unresolved P and R branches were available for 
analysis, only a few of the adjustable parameters could be obtainedi* 

In this case, the contributions by the weaker isotopes would be 
neglected, the lines would be assumed to have some width independent 
of m, and only the background parameter a, the band center, the 
average B value, and Jr*( 0)|^ estimated. These values could then be 
used as initial guesses to analyze more highly resolved spectrum. 
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Figure Captions 


Fig. 1. 


Fig. 2. 


- The experimental CO spectrum analyzed is shown in the top 
curve, beneath it is the spectrum calculated from the 
retrieved values in column 4, Table 5* The other curves 
show the ratios and differences of these two spectra. 


- The continuous curve represents the polynomial dependence of 
the CO line widths on m calculated from the retrieved 
values shown in column 4 , Table 5. The open circles 
represent the average half widths obtained from Benedict, 
Herman and Silverman and the asterisks show the half widths 
in the latest version of the AFGL atmospheric line listing. 
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TABLE 1. Sample conditions and spectral characteristics 


Spectrum 

1 

2 

3 

4 

Absorber 

HC1 

HC1 

HC1 

CO 

£ 2^ i 

3.065 

3.065 

3.065 

89.25 

Absorber amt. 
atm. cm-(296K) 

0.421 

0.421 

0.421 

0.350 

Partial Pressure 

700 

700 

50 

700 

of N 2 (Torr) 





Sample Temp (K) 

296 

296 

296 

296 

Path Length (m) 

171 

171 

171 

171 

Spectral Limits 
Upper (cm - l) 

3050 

3050 

3050 

2235 

Lower (cm~l) 

2671 

2671 

2671 

2025 

Spacing of data 
points (cm - l) 

0.0603 

0.0603 

0.0603 

0.0537 

No. of points 

3611 

3611 

3611 

3920 

analyzed 





v (cm - l) 

2877 

2877 

2877 

2130 





TABLE 2. HC1 Spectra: estimated parameter values 



Input Values 


Retrieved 


Retrieved 


10 8 b cm 
10 8 c cm 2 
10R cm“l 

io 5 v 1 

i0 4 V 2 

io 2 ei 

10 % 

u i cm - l 
b l 

Bj cm - 1 
10 4 D^ cm~l 
10 8 Hj cm -1 


10 4 D^ cmT^ 
10%" cm -1 
Vb2 cm_1 

I 

B 2 cm 1 
10 4 D 2 cm~l 
10 8 H 2 cm" 1 
B 2 cm -1 
10 4 D 2 cm'l 
10 8 H 2 " curl 
10 ao cm-l 
10 4 u^ cm~l 
10 8 a 2 cm“l 

10 8 ctg cm - ^- 

i , nm 1 


5.2740 

0 

0 

1.04 

7.604 

2.349 

-2.60 

4.50 

2883.8705 

10. 12090 

5.200 

1.63 

10.42430 

5.270 

1.63 

2885.9775 

10.136223 
5.2167 
1.65 

10.440254 
5.2835 
1.65 
1.0497 
3.84 
-1.568 

-5.76 

+6.64 


5.2736 

1 

-2 

1.04 

7.56 



2.35 

(2) 

■2.5 

(1) 

5 

(3) 

2883.870 

+.0003 


10.12090 

(9) 

5.21 

(2) 

2 

(1) 

10.42417 

(9) 

5.25 

(2) 

1 

(1) 

2885.979 

+.0003 


10.13615 

(?) 

5.20 

(1) 

0 

(1) 

10.44032 

(5) 

5.290 

(8) 

1.8 

(7) 

U049 

(1) 


5.276 

-48 

80 

1.15 

6.68 

2.00 

-3.16 

-16 

2883.870 

10.12115 

5.242 

3.9 

10.42485 

5.350 

3.9 

2885.980 

10.13601 

5.203 

0.2 

10.44064 

5.325 

0.2 

1.06 

5.55 

-1.748 


44.09 (3) 

-2.3 x 10 3 (2) 
4 x 10 2 (2) 

1.19 (2) 


2883.870 

±0.00004 

10.12134 (7) 

5'. 241 (3) 

3.9008 (3) 

10.42458 (7) 
5.328 (3) 

3.9009 (3) 

2885.980 

±.00004 

10.13614 (4) 

5.203 (3) 

0.6 ( 6 ) 

10.44032 (2) 

5.305 (4) 

0.6 ( 6 ) 

1.4 (1) 

50 (30) 

-2 x 10“ 3 
(bound) 

-18 (19) 

11 ( 1 ) 












TABLE 3. Some intermediate parameter values obtained during the analysis of a synthetic HC1 spectrum 


Parameter 

Res. Sum. Squares 

a 

R(cffl-l) 

10 2 Bj 

10 4 Di" (curl) 

B 2 (ctd"-1) 

10a o (cm"l) 

Initial Guess 


5.2740 

0.10400 

-2.60 

5.270 

10.13600 

1.050 

Iteration Number 








1 

22.827 

5.2740 

0. 10363 

-2.62 

5.272 

10.13597 

1.162 

5 

13.330 

5.2739 

0.10458 

-2.60 

5.274 

10.13621 

1.059 

10 

10.064 

5.2746 

0. 10206 

-2.52 

5.247 

10.13621 

1.053 

15 

9.931 

5.2737 

0. 10388 

-2.46 

5.249 

10.13614 

1.050 

16 

9.929 

5.2736 

0. 10391 

-2.46 

5.252 

10.13615 

1.049 














TABLE 4. Estimate of the asymptotic correlation matrix obtained from the analysis 
of a synthetic HC1 spectrum (all values multiplied by 1000) 


Parameter 

a 

b 

c 

R 

V 1 

v 2 

Bl 

b 2 

v bl 

B 1 

a 

1000 










b 

435 

1000 









c 

-529 

-334 

1000 








R 

-322 

-153 

166 

1000 







V 1 

-474 

-437 

321 

472 

1000 






V 2 

-520 

-423 

342 

602 

866 

1000 





h 

-31 

-15 

16 

341 

284 

286 

1000 




h 

-184 

158 

430 

148 

-73 

100 

-56 

1000 



v bi 

513 

-441 

-172 

23 

112 

59 

63 

-354 

1000 


b; 

223 

260 

397 

-34 

74 

48 

70 

298 

-36 

1000 

K 

447 

430 

281 

-109 

14 

-17 

56 

326 

105 

846 

K 

582 

504 

79 

-154 

-81 

-108 

74 

239 

201 

600 

Bl 

77 

-105 

680 

34 

122 

103 

66 

285 

144 

842 

h 

-33 

-170 

857 

47 

163 

158 

16 

398 

166 

584 


-66 

-212 

806 

20 

94 

97 

40 

384 

142 

334 

^2 

-232 

6 

834 

132 

61 

93 

47 

525 

-181 

458 

B2 

-127 

-553 

202 

-13 

378 

332 

-29 

-250 

376 

27 

d 2 

-131 

-799 

234 

53 

477 

416 

25 

-295 

687 

-73 

H 2 

-109 

-740 

208 

75 

433 

378 

62 

-223 

677 

-93 


-337 

-26 

467 

19 

184 

196 

-50 

219 

—366 

282 

* ^ 
d 2 

-492 

373 

504 

79 

-50 

7 

-54 

535 

-884 

292 

H 2 

-294 

457 

236 

13 

-215 

-163 

-50 

462 

-793 

166 

«o 

463 

465 

-134 

-671 

-770 

-900 

-240 

-17 

-142 

129 

a l 

31 

14 

-15 

-299 

-229 

-250 

-859 

-35 

-52 

-64 

a 2 

-247 

-630 

-221 

196 

453 

402 

66 

-566 

430 

-335 

a 3 

-11 

-63 

-57 

228 

158 

184 

595 

-70 

101 

-19 

a 4 

86 

497 

351 

-72 

-110 

-99 

96 

449 

-416 

371 


V 



TABLE 4 (cont) 


Parameter 

D 1 

H 1 

* 

B 1 


, H 1 

v b2 

B2 

D 2 

«2 

B2 

D i 

1000 










Hi 

912 

1000 









B f 

649 

410 

1000 









610 

463 

834 

1000 







H 1 

444 

398 

614 

919 

1000 





* 

v b2 

394 

233 

682 

827 

785 

1000 





B 2 

64 

46 

138 

200 

163 

-313 

1000 




D2 

-86 

-110 

183 

249 

. 225 

-78 

677 

1000 



h 2 

-127 

-151 

176 

231 

217 

65 

366 

916 

1000 


b 2 

253 

144 

Ilk 

342 

287 

101 

547 

-94 

-343 

1000 

D 2 

134 

-35 

228 

249 

218 

552 

-379 

-642 

-609 

466 

a * 

H 2 

30 

-85 

72 

47 

58 

473 

-670 

-695 

-525 

-7 

°o 

219 

264 

73 

67 

108 

134 

-361 

-449 

-392 

-83 

a l 

-61 

-81 

-49 

-11 

-32 

-38 

20 

-28 

-59 

44 

a 2 

-447 

-437 

-269 

-370 

-395 

-579 

601 

738 

616 

-99 

a 3 

-22 

14 

-32 

-73 

-54 

-80 

61 

93 

90 

-55 

a 4 

440 

381 

345 

450 

451 

639 

-510 

-587 

-463 

143 




TABLE 4 (cont) 


Parameter 

/ <* 
»2 

H2 

a o 

a l 

«2 

a 3 

a 4 

s a 
D 2 

1000 







-* > 
H 2 

836 

1000 






<*o 

165 

285 

1000'' 





“1 

52 

49 

215 

1000 




a 2 

-615 

-647 

-578 

-52 

1000 



“3 

-150 

-147 

-204 

-877 

158 

1000 


“4 

642 

632 

336 

-29 

-867 

-177 

1000 



TABLE 5. CO Spectrum. Estimated Parameter Values 


S ' s '\ Retrieved 

Parameter''*'^ 

1 

2 

3 

4 

Calculated 

1 

a 

53.07 

( 2 ) 

53.11 

( 2 ) 

53.11 

(D 

53.095 


2 

10 ^b cm 

O.Of 


-1 

( 2 ) 

O.Of 


O.Of 


3 

10 ^c cm 2 

O.Of 


-1 

(4) 

O.Of 


O.Of 


4 

10R cm -1 

1 . 040f 


0.69 

(D 

0.69 

U) 

0.689 


5 

io 6 v 1 

1.30 

( 2 ) 

1.22 

(4) 

1.22 

(4) 

1.215 

1.227 

6 

io*v 2 

1.25 

(3) 

1.14 

(3) 

1.14 

(3) 

1.121 

1.143 

7 

10 4 $ 1 

-2 . 80f 


-3 

( 2 ) 

- 2 . 8 f 


- 2 . 8 f 

- 2.8 

8 

10 % 

7.50f 


7.5f 


7.5f 


7 . 5f 

7.5 

9 

v j cm " 1 

2096.0686 

( 8 ) 

2096.06 

(D 

2096.06 

(D 

2096.058 

2096.0670 

10 

Bj cm x 

1.8216 

(!) 

! 1.822 

(3) 

1.822 

(3) 

1.8217 

1.82161 

11 

10 6 dJ cm -1 

6.2 

(3) 

6.05 

(4) 

6.04 

( 6 ) 

5.922 

5.594 

12 

i° 12 hI 

5. Of 


5. Of 


5. Of 


5. Of 

5.0 

13 

cm 1 

1.83809 

(9) 

1.838 

(3) 

1.838 

(3) 

1.8380 

1.83797 

14 

io 6 dJ' 

6.5 

(2) 

5.58 

(4) 

5.58 

(7) 

5.478 

5.596 

15 

10 12 %" cm " 1 

5. Of 


5. Of 


5. Of 


5. Of 

5.0 

16 

v 2 cm -1 

2143.2696 

(4) 

2143.269 

(!) 

2143.2686 

CD 

2143.26848 

2143.2715 

17 

1 

B 2 cm"- 1 - 

1.905045 

( 8 ) 

1.9050 

(3) 

1.904984 

(5) 

1.904990 

1.905026 

18 

IO 6 D 2 

6.103 

( 1 ) 

6.039 

(3) 

6.039 

(3) 

6.0400 

6.1195 

19 

10 12 H 2 cm - ^ 

5.44f 


6 

(9) 

5.44f 


5.44f 

5.44 

20 

I 

B 2 cm " 1 

1.922575 

(9) 

1.9225 

(3) 

1.922504 

(5) 

1.922510 

1.922529 

21 

10 6 D 2 ^ cm " 1 

6.142 

(3) 

6.066 

(3) 

6.066 

(3) 

6.0673 

6.1202 

22 

10 1 ^H 2 ’ > cm " 1 

5. 60f 


6 

(9) 

5. 60f 


5.60f 

5.60 

23 

10 2 a o cm " 1 

8.1 

( 2 ) 

8.7 

( 2 ) 

8.7 

( 2 ) 

8.73 

8.18 

24 

10 ^a^ cm -1 

-4.7 

( 2 ) 

-4.8 

( 1 ) 

-4.8 

(D 

-4.75 

-5.06 

25 

10 4 a 2 cm -1 

2.6 

(i) 

3.0 

( 1 ) 

3.0 

CD 

2.91 

2.91 

26 

106«3 cm -1 

-5.9 

(3) 

-7.2 

(3) 

-7.2 

(3) 

-7.01 

-5.84 


Ref. 


12 

12 

11 

11 

11 

11 

11 

11 

11 

10 

10 

10 

10 

10 

10 

10 

14 

14 

14 

14 
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ABSTRACT 


Th e retr ieval of the Lorentzian width, intensity, and position of 
Voigt shaped lines obtained with an instrument of finite spectral reso- 
lution by spectral curve fitting and the use of non linear least squares 
techniques is described. The accuracy with which the values of these 

parameter s c an be re trieved was tested by analyzing synthetic spectra 

to which random noise was added. Some of the requirements of the experi- 
mental data to obtain good retrievals are identified. 
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Introduction 


A knowledge of the positions, shapes, and intensities of spectral 
lines is a prerequisite for the quantitative analysis of line spectra. 

Line position measurements are often made independently from estimates 
of the line intensity and halfwidth. These latter quantities are usually 
determined from curve of growth measurement s of the equivalent widths of 
lines obtained from several spectra. 

In a recent series of papers Chang et al 1 ’ 2 ? 3 have discussed the 
retrieval of information from spectra by curve fitting using non linear 
least squares techniques. It has been shown that all the line parameters 
can be obtained from the analysis of a single spectrum provided the phys- 
ical conditions of the sample -pressure, temperature, absorber concentra- 
tion, and path- length, are known, and that suitable mbdels"'fcr the ‘shapes 
of the lines and the instrument spectral response function are used. 
Conversely, if the line parameters are known, the physical conditions of 
the sample can be estimated. 

In this previous work a Lorentzian line shape was assumed. This 
describes the line shape adequately for samples near ambient temperature 
and atmospheric pressure but it becomes necessary to use the mixed 
Doppler- Lor entz (Voigt) profile as the sample pressure is reduced. Thus, 
although the Lorentzian shape can be used to predict the infrared trans- 
mittance in the troposphere, the Voigt shape is required for stratospheric 
or mesospheric calculations. 

In this paper it is shown that the curve fitting technique can be 
extended to allow Voigt shaped lines to be analyzed. The retrieval of the 
line parameters from an absorption spectrum, consisting of either a single 
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isolated line or double lines, obtained with an instrument of finite res- 
olution is discussed. By analyzing synthetic spectra to which random 
noise has been added it is shown that, provided the Doppler width of the 
line and the spectral response function shape are known, the line position 
and intensity, the Lorentz half width, the spectral resolution, and the 
position of the background curve in the absence of line absorption can be 
retrieved from spectra taken with a wide variety of experimental condi- 
tions. In addition, experimental data of the P(l) and P(9) pairs of iso- 
topic lines in the 1-0 band of HC1 have been analyzed by assuming them to 
have both a Lorentz shape and a Voigt shape. 

The principal computer programs used in this work were developed by 
the Health Sciences Computing facility at the University of California and 
are described in Ref. (4) . Both single and double precision versions have 
been used to verify the accuracy of the computations. 

Method 

If a homogeneous absorbing gas sample has a monochromatic absorption 
coefficient k(v) at frequency v, the transmitted signal I m (v) is related 
to the signal Io(v) , obtained with the absorber removed, by 

ImO) = Io(v) exp[-k(v) u] , (1) 

where u is a measure of the amount of absorber present, 

P . T 

u = -JJL . _o_ . j(atm-cm,Po,To) , (2) 

“o abs 

where the sample temperature is T , , the partial pressure of the absorber 

3*0 S 

is P , the absorbing path length is j?(cm), and To and Po refer to some 
standard temperature and pressure, respectively. All of the quantities 
defining u are assumed known. 
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J-f "the spectrum is obtained with an instrument of finite resoiving 
power then the observed signal can be written as 

Tfv'l - / I m(v') cr(y)V’) dv ’ 

/ ff(v,v>) dv * a 

where cr( v,v 1 ) describes the instrument spectral response function. A 
triangular shape has been used in this and previous work. 

We have assumed: 

1. cr( v, v ' ) is triangular whose full width at half -height H is to be 
retrieved in the analysis. 

2. Io(v) =Ai +Aa(v - vo), where Ai and As are constants to be 
determined, and vo is chosen to lie near the middle of the spec- 
tral region analyzed. In the analysis of synthetic spectra de- 
scribed here vo = 0.7320 cm -1 . 

3. Values of l(v) are obtained at frequency intervals Av. 

4. k(v) is described by the Voigt line shape. 

The Voigt line shape can be described in terms of the -Voigt function 


k(x ; y) . SM .If 

ko * J-o 


exp(-t 2 ) 
y 2 + (x - t ) 2 


where k(v) is the absorption coefficient, 
, S (Sn 2\' /a 


_ _S_ / Sn 2y /a 
ap \ * / 5 


y = ^ 2)» , 

cx D 


V - Vq 


(in 2)* , 


S = f k(v) dv , is the line intensity. 
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= Lorentz half width due to radiation damping or collisional 
broadening, 

Vo / 2kT 2m 2 V 6 

Op = ~ l — ) , is the Doppler half -width, 

and 

Vo = line center position. 

The Voigt function cannot be evaluated exactly but a number of tech- 

. niques_for calculating it s approximate value have been described . 5 > e > 7 > 8 

A fast, _calculational method for obtaining its value to about one part in 

10 4 has been described by Pierluissi et al. 9 Three different approxima- 
tions are, used which, depend on_the values of x and y. This program has 
been_modified to include the case of y = 0 in the "region 

1(0 § 1.8, .0 S _y g 3.0) and to reach higher precision for the case of 

pure Doppler lines. 

In order to calculate a spectrum of a Voigt line it is necessary to 
.. know the ,v allies o_f _ the quantities Ap_, As.',' H,” S, u,“ ' v c ,‘ a5~and ofi ' " Since ~ 
the Doppler width is not strongly dependent on temperature and since the 
molecular mass of the absorbing gas is usually known we have assumed the 
ap can be calculated. Thus, up to six other parameter values must he de- 
termined by spectral curve fitting. 

In the technique described by Chang et al, 1 ? 2 * 3 a calculated spec- 
trum, based on initial guesses of the unknown parameters, is compared with 
the observed spectrum. The parameter values are then varied until the 
best fit (defined in the usual manner, that the sum of the squares of the 
differences between the observed and calculated spectrum is a mininum) is 
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obtained. There are a number of computational methods for arriving at 
this best fit, we have used the programs BMDP3R and BMDPAR described in 
Ref. (4). 

The subroutines for analyzing Voigt shaped lines for use with these 
programs are given in Appendices I and II. The subroutine in Appendix I 
is designed to be used with BMDP3R for the simultaneous retrieval of the 
parameters of two absorption lines. Appendix II gives the corresponding 
subroutine for a single line for use with BMDPAR. Based on the initial 
parameter guesses, values for x and y are calculated for each position in 
the spectrum. The corresponding value of the Voigt function is evaluated 
by the method of Pierluissi et al 9 and this is used together with 
Eqs. (1-4) to calculate the transmitted signal l(v). The subroutine for 
use with BMDP3R also gives analytical expressions for the partial deriva- 
tives of Eq. (3) with respect to each of the unknown parameters required 
for the curve fitting program. 

The Voigt function k(x,y) used in the three different regions pro- 
posed by Pierluissi et al is given by the real part of w(z), where w(z) 
is provided by Abramowitz and Stegun 10 as the following; 

in region I (OS |x| < 3, 0 S y < 1.8) 

, _„2 

w(z) = e [l - erf(-iz)] , 

where z = x + iy, 



erf (-iz) 


= JL y 


(-D 


n 


n! (2n + 1) 


(-iz) 


2n+l 


(6) 


and n = 15 if x = 0, otherwise n = 6.842x +8.0. The number of terms to 


be included was determined through an empirical relation in terms of x 
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(7) 


and y proposed by Pierluissi et al, 9 
in region II (3 £ jx| < 5, 1.8 ^ y < 5),‘ 

.4613 ^ O.Q9999 


w(z) 


. / 0.1 
= 1Z (— 


0.00288 


0.1901 ' z 2 - 1.7844 ' z 2 - 5.5253/ * 
and in region III (jx| § 5, y ^ 5), 


w(z) = iz 


0.5124 


0.05176 \ 


v z 2 - 0.2752 ‘ z 2 - 2.7247 / * 
The derivatives of w(z) are, 
in region I, 


(8) 


dw(z) 

dz 


(-2z) 


w(z) 



(9) 


in region II, 


dw(z) 


dz 




0.4613 + 0.09999 + 0.0Q288 \ 

z 2 - 0.1901 z 2 - 1.7844 z 2 - 5.5253/ 


- i(2z 2 ) 


t 


0.4613 


0.09999 


+ 


0.00288 


(z 2 - 0.1901) 2 (z 2 - 1.7844) 2 (z 2 - 5.5253) 2 


■) 


and in region III, 


dw(z) 

dz 


= i / 0.5124 + 0.05176 ^ 

\z 2 - 0. 


5124 

2752 ' z 2 - 2.7247 ) 


- i(2z 2 ) 


fc 


0.5124 


0.05176 


-) 


( 10 ) 


t (z 2 - 0.2752) 2 (z 2 - 2.7247) a 

Equations (5-11) are used to obtain the derivatives of Eq. (3) with 


( 11 ) 


respect to each unknown parameter. It should be noted that the value of 
x defined in Eq. (4) may take either positive or negative values but the 
values used in the expressions for the Voigt function given by Pierluissi 


et al are always positive or zero. Thus, although the absolute values of 


x are used to obtain the absorption coefficient k(x,y), the correct sign 


must be assigned to the partial derivatives required in the analytical 


program shown in Appendix I. 


C - 2 . 


6 




Results 


The accuracy of the modified method described by Pierluissi et al 
for calculating k(x,y) was tested by comparing values for this quantity 
for wide ranges of x and y values with the tabulation of k(x,y) given by 
Young. 5 The agreement was well within the accuracy of one part in 10 4 . 

The analytical program of BMDP3R requires partial derivatives of 
Eq. (3) for each unknown parameter for each of the three approximations 
to the Voigt function. Tests of the accuracy of these derivative expres- 
sions were performed by calculating the transmittances (to six significant) 
figures) of a single Voigt shaped line, for the parameter values in 
Table 1. This table also shows the total number of data points calculated 
in each spectrum, their spacing Av, and the number of data points for 
which the transmittance was less than 95%. These parameter values de- 
scribe lines in each of the three regions defined by Pierluissi et al. 9 
Values of the line parameters and other quantities were then retrieved by 
using program BMDP3R in single precision by considering these calculated 
transmittances to be "experimental" data. The initial guesses for the 
line parameters were usually within 30 % of the true values. In a.l 1 cases 
the values retrieved hy the program agreed with the initial values in 
Table 1 to tetter than one part in 10 4 after the number of iterations 
shown. 

These tests confirm the accuracy of the programs hut they do not 
establish their ability to retrieve information from actual spectra. 

Since our present spectrometers have insufficient resolution to obtain 
suitable spectra of lines in the regions I and II defined hy Pierluissi 
et al, it was necessary to use synthetic data. A series of line 
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profiles was calculated, for several sets of values of the adjustable pa- 
rameters and random Gaussian noise was added to these profiles before 
analysis to give a signal- to-noise ratio of 100. Some of the results are 
shown in Table 2. 

This table shows the parameter values used to generate each spectrum, 

the total number of data points analyzed, the number of data points for 

which the transmittance was less than 0.95? the spacing between data 

\ 

points, and the number of iterations carried out by the program. The re- 
trieved values together with the standard deviations obtained by the pro- 
gram are also shown, where, in all cases it was assumed that the Doppler 
width was known. In some of the retrievals the background parameter A 2 
was constrained to lie between ± 0.001 cm; in other cases the upper and 
lower bounds were increased. 

It is seen uhat essentially all the retrieved values agree with the 
initial values within two standard deviations. In some cases the standard 
deviations are large, either because the number of data points containing 
useful information was small or because the ratios of the spectral resolu- 
tion or Lorentz width to the Doppler width of the line were too small. 

These results were obtained by using the single precision version of 
the BMDP3R source program. It is obvious that this precision is adequate 
for the present work. New versions of the BMDP programs 4 which use 
double precision are now available. In addition to the double precision 
version of BMDP3R there is a new program BMDPAR which does not require 
the partial derivatives of the function with respect to each parameter. 
This greatly reduces the subroutine programming. We have compared results 
obtained by using the program BMDPAR and also the single precision and 
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double precision versions of BMDP3R both for Lorentz lines and Voigt lines. 
In analyzing synthetic spectra of Voigt lines, we have found no signifi- 
cant differences in the values of the parameters retrieved by the single 
and double precision versions of BMDP3R. In some cases the number of it- 
erations required by the double precision version was less than that for 
the single precision version and there was a small decrease in the comput- 
er time required. 

Experimental spectra of the P(l) and P(9) pairs of isotopic lines of 
the 1-0 band of HC1 have also been analyzed. The spectra were obtained 
from a sample containing 0.421 atm cm (296 K) of HC1 with U 2 added to give 
a total pressure of 700 Torr in a 171 m path at 296 K. In a previous anal- 
ysis 11 of these spectra the lines were assumed to have a Lorentz shape and 
the single precision version of BMDP3R was used to retrieve the line pa- 
rameters. Analyses of many pairs of lines in the band suggested the best 
value of the spectral resolution was 0.104 cm -1 . The Voigt expression ap- 
proaches the Lorentz expression for large x and y values and the spectra 
have been reanalyzed by using the double precision versions of BMDP3R and 
BMDPAR. The results of several analyses of the P(l) and P(9) spectra are 
shown in Table 3 and Table 4 respectively. In some of the retrievals, the 
spectral resolution and the line intensity ratio were constrained as de- 
scribed in our previous analysis of these spectra. 11 The different analy- 
ses in each of these tables give essentially the same results and confirm 
that all the lines analyzed can be described adequately by the Lorentz 
shape. 

The spectra of the P(l) lines have also been analyzed by using the 
derivative free BMDPAR program. Both Voigt and Lorentz line shapes have 
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been used. A comparison of tbe results obtained by using both BMDP3R 
and BMDPAR is given in Table 5. The results are essentially identical. 

It has been found that the BMDPAR program is the more expensive and that 
the initial values should be as close as possible to the best values for 
rapid convergence. 

Examples of some of the data analyzed are shown in Pigs. 1-4. The 
three lines in Fig. 1 have a pure Doppler shape, mixed Doppler-Lorentz 
shape, and a pure Lorentz shape, and their corresponding k(x,y) values 
lie entirely in the regions I, II, and III defined by Pierluissi et al. 

The parameters used to obtain these lines are given in the first three 
columns of Table 1. 

Figures 2 and 3 show synthetic spectra with simulated 1% Gaussian 
random noise, the spectra calculated from the retrieved parameters, and 
the ratio and difference of the spectra which correspond to experiments 
2a and 3 respectively in Table 2. 

Figure 4 shows the experimental spectrum of the P(9) lines of HG1, 
the spectrum calculated from the constants obtained by assuming a Voigt 
profile and by using BMDPAR to analyze the data together with the ratio 
and difference of the observed and calculated spectra. Figure 5 shows the 
ranges of the values of the Voigt parameters x and y which correspond to 
the line parameters given in Tables 1-5. 

Discussion 

Absorption spectra of Voigt shaped lines obtained with an instrument 
of finite resolution must be modeled as complex convolutions of several 
analytical functions. These functions contain a small number of adjust- 
able parameters. It has been shown that the values of these parameters 


10 



can often be retrieved with reasonable accuracy by spectral curve fitting 
of a limited amount of data from a single spectrum provided: 

1. the frequency spacing between the experimental data points is 
comparable to, or smaller than, the widths of the lines, 

2. the spectral resolution is comparable to the line width, 

3. if the Lorentz width is to be determined, it should not be 
smaller than the Doppler width. 

The synthetic spectra analyzed in this work correspond to the types 
of spectra obtained, for example, by high resolution Fourier Transform 
Spectrometers and the analysis was restricted to the retrieval of line 
parameters. Spectra with Voigt shaped lines can be collected by other 
xypes of instruments either for the purpose of obtaining information about 
line parameters or of the physical characteristics of the absorbing me- 
dium. Chang et al 1 have explored applications of spectral curve 
fitting to retrieving information from spectrum consisting of Lorentz 
shaped lines. The present work shows that these same techniques can now 
be used to analyze spectra with Voigt shaped lines. 

In the process of exploring these techniques, we have attempted to 
determine, within the limitations imposed either by the available instru- 
mentation or the nature of the absorbing sample, the amount of information 
contained in the spectrum and methods of optimizing the desired informa- 
tion content in the data collected. Thus, for example, if it is desired 
to measure the Lorentz width and intensity of an isolated spectr al line, 
it is desirable to maximize the spectral resolution, the number of data 
points collected, and the signal-to-noise ratio in the spectrum, and to 
minimize the spacing between data points. Also the sample pressure should 
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be increased to make the Lorentz width large compared with the Doppler 
width. If these conditions are met, then the line will have a Lorentz 
shape and can be analyzed by the method of Chang et al . 1 ? 2 ? 3 However, 
the infrared spectra of many molecules consist of closely spaced lines 
and the sample pressure must be reduced to narrow the lines and minimize 
the, amount of overlapping. Under these circumstances, the line may have 
a Voigt shape. Although spectra of finite resolution have been discussed, 
it is now possible to obtain spectra with essentially infinite resolution. 
In these cases, the true line profile is obtained and the spectral curve 
fitting technique will often allow the simultaneous dete rmin ation of both 
the Lorentz and Doppler line widths, if desired. 

The ability to analyze Voigt shaped lines will also allow, for exam- 
ple, solar spectra observed above the tropopause to be analyzed. The tel- 
luric lines in these spectra due to gases in the stratosphere and meso- 
sphere have Voigt shapes and contain information about the vertical 
distributions of the absorbing gases and the pressure and temperature 
profiles of the atmosphere. These least squares techniques will allow 
sets of spectra to be analyzed separately or s imul taneously, to retrieve 
the physical characteristics of the atmosphere by assuming the line param- 
eters are known, and an appropriate model atmosphere. 
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Table 1. Values of the parameters used ro generate synthetic specura to test 
the accuracy of the computer programs 


Voigt 

Region 

I 

II 

III 

I, II, 
and III 
Combined 

I, 11, 

and III 
Combined 

a D ( cm.' 1 ) 

0.027506 

0.0027506 

0.0027506 

0.0027506 

0.0027506 

a L (cm'* 1 ) 

0.000132 

0.0070 

0.03304 

0.005616 

0.002632 ' 

SU/it (cm -1 ) 

0.012046 

0. 004015 

0.037477 

0.008031 

0.010708 

v 0 (cm -1 ) 

0.7320 

0.7320 

0.7320 

0.7320 

0.7320 

H (cm -1 ) 

0.00060 

0.00080 

0.0500 

0.008001 

0.00200 

Ai 

5.274 

5.274 

5.274 

5.274 

5.274 

A s (cm) 

0.000 

0.000 

0.000 

0.000 

0.000 

Av (cm -1 ) 

0.00050 

0.00020 

0.015 

o.ooio847 

0.001241 

Total number 
of points 

240 

356 

90 

100 

200 

Humber of points 
for T < 0.95 

200 

273 

21 

100 

93 

Number of 
iterations 

32 

25 

1 

11 

45 

15 
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Table 2, Initial values of the par curie lei s used to calculate absorption spectra of single Voigt shaped lines. The parameter values and their standard de- 
viations retrieved by using the single piecislon version of the program BMDP3R are also given. 


Exp. 

Data 

l° 3 an 

AO'H 

10 3 su/it 

Vo \ 

Ar 


10 3 H 

Total 

Ho. Of Pts. 

10 3 Av 

Mo of 

(cm- 1 ) 

(cm -1 ) 

(cm -1 ) 

(cm -1 ) 

(cm) 

(cm" 1 ) 

No. Pts. 

Trans < 0.95 

(cm" 1 ) 

Iterations 


Initial Value 

2.75 

5.50 

„ 4.015 

0.7320 

5.274 

0 

5.50 





1 

Retrieved 

Fixed 

6.53 

4.25 

0.7318 

5.295 

-0.001 

2.6 

50 

20 

2.20 

9 


lo 

- 

0.56 

0 11 

0.0001 

0.015 

“ 

4.6 






Initial Value 

2.75 

5.50 

e.031 

0.7320 

5.274 

0 

16.5 





2a 

Retrieved 

Fixed 

6.68 

8.32 

0.7318 

5.317 

-0.00 

15.5 

50 

28 

2.20 

7 


la 

- 

0.71 

0.12 

0.0001 

0.023 

0.22 

1,2 






Initial Value 

2.75 

5.50 

8 031 

0,7320 

5.274 

0 

16.5 





2b 

Retrieved 

Fixed 

5.32 

8.09 

0.7321 

5.267 

+0.001 

16.7 

100 

28 

2.20 

11 


la 

“ 

0.49 

0.10 

0.0001 

0.009 

- 

0.9 






Initial Value 

h.oo 

8.00 

8.031 

0.7320 

5.274 

0 

12.0 





3 

Retrieved 

Fixed 

8.10 

8.13 

0.7324 

5.269 

0.00 

11.2 

100 

14 

5.00 

5 


la 

■* 

0.64 

0.13 

0.0002 

0.007 

0.03 

2.3 






Initial Value 

4.oo 

12.0 

12.05 

0.7320 

5.274 

0 

20,0 





1( 

Retrieved 

Fixed 

11.9 

12.15 

0.7324 

5.268 

0.02 

20.0 

100 

21 

5.00 

7 


la 

- 

0.9 

0.16 

0.0002 

0.008 

0.03 

0.2 






Initial Value 

3.00 

12.0 

12.05 

0.7320 

5.274 

0 

24.0 





5 

Retrieved 

Fixed 

11.4 

12.1 

0.7315 

5.270 

0.00 

24.3 

200 

16 

6.00 

8 


la 


o.9 

0.2 

0.0002 

0.004 

0.01 

o.s 

! 






Initial Value 

27.5 

0.132 

12.05 

0.7320 

5.274 

0 

0.606 





6 

Retrieved 

Fixed 

0.01 

12.02 

0.7320 

5.271 1 

0.00 

3.6 

240 

210 

7.60 

2h 


1(7 

” 

0.50 

0.10 

0.0000 

0.008 

0.03 

10.0 






Table 3. Comparison of HC1 P{1) line parameters retrieved by U3ing the single and double precision versions of BMDP3R for Lorentz and Voigt shapes 


Exp. 

Iterations 

Frecioion 

Line 

Shape 

Constraints 

Residual 

Ai 

Aa 

(cm) 

Si.tr/rc 

(cm -1 ) 

S 2 U/rt 

(cm -1 ) 

q l § 

(cm -1 ) 

Vi 

(cm -1 ) 

1’2 

(ora -1 ) 

1 

II 

( cm -1 ) 

A 

9 

Double 

! 

Voigt 

Sp/Si* = 3.0106 
II - 0.104 

2.84265 

5.275 

±0.005 

0.0006 

±0.0007 

0.187 

±0.003 

0.577 

±0.010 

0.091 

±0.002 

2863.014 

±0.002 

2865.093 

±0.002 

o.io4 

D 

12 

Double 

Voigt 

Hone 

2.8390 

5.276 

±0,005 

0.0006 

±0.0007 

0.165 

±0.003 

0.559 

±0.027 

0.095 

±0.006 

2063.014 

±0.002 

2065.093 

±0.002 

0.089 

±0.003 

c 

9 

Double 

Lorentz 

Ss/Sj. = 3.0186 
II « 0.104 

2.8424 

5.275 

±0.005 

0.0006 

±0.0007 

0.187 

±0.003 

0.577 

±0.010 

0.091 

±0,002 

2863.014 

±0.002 

2865.093 

±0.002 

0.104 

D 

12 

Double 

Lorentz 

Hone 

2.8390 

5.276 

±0.005 

0.0006 

±0.0007 

0.185 

±0.034 

0.559 

±0,027 

0.095 

±0.006 

2863.014 

±0.002 

2865.093 

±0.002 

0.089 

10.003 

E 

26 

Single 

Voigt 

Ss / S x = 3.0186 
H = o.io4 

2.8425 

5.275 

±0.005 

0.0006 

±0.0007 

0.137 

±0.003 

0.577 

±0.010 

0.091 

±0.002 

2863.014 

±0.002 

2865.093 

±0.002 

0.104 

F 

i 

18 

i 

Single 

Voigt 

Hone 

2.8392 

5.276 

±0.005 

0.0006 

±0.0007 

0.185 

±o.oo4 

0.559 

±0.027 

0.095 

±0,006 

2863.014 

±0.002 

2865.093 

±0.002 

0.089 

±0.003 

G 

26 

Single 

Lorentz 

Se/Si = 3.0186 
II = 0.104 

2.8423 

5.275 

±0.005 

0.0006 

±0.0007 

0.187 

±0.003 

0.577 

±0.010 

0.091 

±0.002 

2863.014 

±0.002 

2865.093 

±0.002 

o.io4 

II 

17 

Single 

Lorentz 

Hone 

2.8389 

5.276 

±0.005 

0.0006 

±0.0007 

0.185 

±o.oo4 

0.559 

±0.027 

0.095 

±0.006 

2863.014 

±0.002 

2865.093 

±0.002 

0.089 

±0.003 


«S 2 /Si. = ratio of the band intensities of ]I 3E C1 and II 37 C1 
^700 Torr IJa pressure-broadened width 























































Table If. Comparison of HC1 P{9) line parameters retrieved by using the single and double precision versions of BMDP3R for Lorentz and Voigt shapes 


Exp 

Iterations 

Precision 

Line 

Shape 

Constraints 

Residual 

A± 

Az 

(cm) 

SlU/if 

(era -1 ) 

SsU/it 

(cm -1 ) 

«i, § 
(cm" 1 ) 

Vi, 

(cm -1 ) 

vs, 

(cm -1 ) 

II 

(cm -1 ) 

I 

7 

Double 

Voigt 

fe/Si* = 3.0186 
It = 0.104 

1.6440 

5.311 

±0.005 

0.001 

±0.001 

0.0247 

±0.0017 

0.O76 

±0.005 

o.oi4 

±0.002 

2675.952 

±0,003 

2677.728 

±0,002 

0.104 

J 

9 

Double 

Voigt 

None 

1.7753 

5.312 

10.005 

0.002 

±0.001 

0.0213 

±0.0010 

0.058 

±0.005 

0.023 

±0.005 

2675.952 

±0.003 

2677.728 

±0.002 

0.084 

±0.008 

K 

7 

Double 

Lorentz 

S a /Si = 3.0166 
II = O.lOt 

1.8453 

5.3U 

±0.005 

0,001 

±0.001 

0.0248 

±0.0018 

0.077 

±0.006 

o.oi4 

±0.002 

2675.952 

±0.003 

2677.728 

±0.002 

o.io4 

L 

9 

Double 

Lorentz 

Hone 

1.7751 

5.312 

±0.005 

0.002 

±0.001 

0.0213 

±0.0010 

0.058 

±0.005 

0.023 

±0.005 

2675.952 

±0.003 

2677.728 

±0.002 

0.084 

±0.008 

M 

9 

Single 

Voigt 

S 2 /Sl = 3.0186 
It = 0.104 

1.8447 

5.3U 

±0.005 

0,001 

±0.001 

0.0247 

±0.0017 

0.076 

±0,005 

o,oi4 

±0,002 

2675.952 

±0.003 

2677.728 

±0.002 

o.io4 

H 

10 

Single 

Voigt 

Hone 

1.7752 

5.312 

±0.005 

0.002 

±0.001 

0.0213 

±0.0010 

0.058 

±0.005 

0.023 

±0,005 

2675.952 

±0,003 

2677.728 

±0.002 

0.084 

±0.008 

0 

9 

Single 

Lorentz 

Sa/Si ~ 3. 0106 
II = 0.104 

1.8452 

5.311 

±0.005 

0.001 

±0.001 

0.0248 

±0.0018 

0.077 

±0,006 

o.oi4 

±0.002 

2675.952 

±0,003 

2677.728 

±0.002 

o.io4 

P 

10 

Single 

Lorentz 

None 

1.7750 

5.312 

±0.005 

0.002 
1 ±0.001 

0.0213 

±a.ooio 

0.058 

±0.005 

0.023 

40.005 

2675.952 

±0.003 

2677.728 

±0,002 

0.084 

10.008 


KSa/Sj. = ratio of band intensities of 1I 3S C1 and II 37 Cl 
700 Torr N 2 pressure-broadened width 



Table 5. Comparison of 1IC1 P(l) line parameters retrieved by using BMDPAR and BHDP3R in double preciaion for Voigt shape 


Program 

Constraints 

Iterations 

Residual 

Ax 

Aa 

(cm) 

Sj.ll/it 

(cm’ 1 ) 

s 2 u/« 

(cm- 1 ) 

! 0 L * 
(cm" 1 ) 

v l 

( cm -1 ) 

V 3 

( cm" 1 ) 

H 

(cm -1 ) 

BWDP3R 

S 2 /S 1 = 3.0816 
II = 0.104 

9 

2.8427 

5.275 

±0.005 

0.0006 

±0.0007 

0.187 

±0.003 

0.577 

±0.010 

1 

j 0.091 
±0.002 

2863.014 

±0,002 

2865.093 

±0.002 

0.104 

BMDPAR 

S?/Si = 3.0816 
ii = o.io4 

39 

2.8647 

5.277 

±0,005 

-0.0006 

±0.0001 

0.188 

±0.004 

0.570 

±0.016 

0,092 

±0.003 

2863.014 

±0.002 

2865.093 

±0.001 

0 io4 

BHDP3R 

None 

12 

2.8390 

5.276 

±0.005 

0.0006 

±0.0007 

0.I86 

±0.003 

0.559 

±0.027 

0.095 

±0.006 

2863.014 

±0,002 

2865.093 

±0.002 

: 

0.089 
±0 003 

BMDPAR 

Bone 

37 

2.8393 

5.276 

±0.005 

0.0006 

±0.000? 

0.186 

±0.003 

0.559 

±0.022 

0.095 

±0.005 

2863.014 

±0.002 

2865.093 

±0.002 

0 089 
±0,002 


*700 Torr II 2 pressure-broadened width 



Fig. 1 


Fig. 2 


Fig. 3 


Fig. ’4 


Fig. 5 


- Noise- free spectra and absorption coefficients of Voigt shaped 
lines calculated for the parameter values in Table 1. Top 
curve: Region I line; middle curve: Region II line; bottom 

curve: Region III line. 


- The top curve shows the spectrum with 1$ added noise calculated 
from the parameter values of Experiment 2a in Table 2, the next 
curve is the spectrum calculated from the retrieved values. The 
bottom curves show the ratio and differences of these spectra. 


- The top curve shows the spectrum with 1% added noise calculated 
from the parameter values of Experiment 3 in Table 2, the next 
curve is the spectrum calculated from the retrieved values. The 
bottom curves show the ratio and difference of these spectra. 


- The top curve shows part of an experimental spectrum near the 
P(9) lines of the 1-0 band of HC1 for a sample containing 
0.4-21 atm-cm (296 K) of HC1 with N2 added to give a total pres- 
sure of 700 Torr. Beneath it is a spectrum calculated from the 
retrieved values of the parameters given in Table 4. The bottom 
curve shows the ratio and difference of these spectra. 2800 cm -1 
should be added to the abscissa values. 


- The ranges of the values of the Voigt parameters x and y corre- 
sponding to the line parameters in Tables 1-5. The boundaries 
of the three regions defined by Pierluissi et al, are also shown. 
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APPENDIX I 


SUBROUTINE EOR USE WITH BMDP3R TO RETRIEVE 
THE PARAMETERS OP TEN VOIGT SHAPED LINES 



// TIME=IZ,REGION=500K 

//CMP EXEC PGM = IFEAA3,PARM=ID,TIME = (0,.5O) 

//SYSLIN DO UNIT— SYSDA»DSNAME=S £LQEJ,SPACE=(CYL,(1,1) ) , 

// DISP=( MOO, PASS) ,0C3=(RGCFM=F6,LRECL=e0,8L<SIZE=3L20) 

//SYSPRINT DO SYSOUT =A , DC3 =3UFNQ = 1 
//CMP.SYSIN 00 * 

SUBROUTINE FUN<F,DF. P , X , N,K ASE,NVA R ,NPAR , IPA SS ,XLGSS ) 

R£AL*3 OF(NPAR),P{NPAR) ,X(NVAR) 

REALMS 8(3H)»ST(32J»CMf32),X3,0QP(.2) , CONST ( 2) , PC 1 , PC2 ,PCC1 , PCC2 , 
AOACKf YEUt 2) ,U(2) ,V(2> , S ( 2) , T( 2 ) ,USeR < 2) ,VSER ( 2 ) ♦ UN( 2 ) ,VN f 2 ) ,U2 { 2 ) , 
*V2(2),UNEw(2) ,VNEW(2 ) ,0E{2) ,01{2),Q2(2) ,G1 (2 ) ,G2 < 2) , VOIGT{ 2) , 
*Rl(2>,URl{2),UR2l2),R0£R(2),UDER{2),R(2>»E<2),Gt2) ,H( 2) ,AKU<32) , 
ATM (32)»RR1»DFF(S) 

C0MPLEX*16 Z2(2),ZA1(Z) ,ZA3 <2),ZA5<2), 

AZ0ER2 ( 2 ) » ZDER3 C 2 J , ZB 1 ( 2 ) ,ZB 3< 2 ) 

REALA3 AN ( 30) /1 . 00000000,— . 33333333 ». 10000000*— . 2 3309 5 233 £—1 » 

A 4.629o2963£— 3*— 7 .57 5757 5 YE— 4, 1 .066376068E— 4,— L .322751323E-5 , 
a 1. 4539 1690E-6, -1.4 503 852226-7,1. 3 122532965-8,-1. 089222 104E-°, 
a 3 -350702795E— 11 »— 5.947794014E— 12,3.955429516E— 13 , 

A — 2.4o6c2701E— 14, 1.446326464E— 15, —3 . 03 273 50 1 2 E— 17 , 

A 4.2214072E9E— 18 ,-2. 107S55 19 IE-19 , 
a 1 -00251 64 Q 4£— 20 »— 4. 55 184675° E— 22 , 1.97706 4754 E— 23, 
*-8.230I49299E~25, 3. 2 392603495-26,-1 .264107899E-27»4.67843352E-29, 

A -1-66976 1793E-30 ,5. 75419 1644£-32,-l. 9 169428625-33/ , 

A Al/. 46131350/, 

* A2/-19016350/,A3/ r .099‘>9216/,A4/l. 73449270/, £5/. 002333394/, 

a A6/5. 52534370/, Bl/. 512424 24/, 32/. 27525510/, 83/. 05176536/, 
a BA/2 .72474500/, PIS 3 /I .128379167/ , PI SORT/ 1 .77245335091/ 

C FIND LINE PARAMETERS AND BACKGROUND USING VOIGT PROFILE ANO 

C TRIANGULAR SLIT FUNCTION 

C F=8 ACKGROUNDACONVC LVED TRANSMITTANCE 

C 8ACK=3ACKGROUND=( A1+A2A(X( 1 )-X0 ) ) 

C P<1)=A1 

C P ( 2 )=A2 

C P(3) = (LINESfl INTENSITY) A(CONCENTRATIONAPATH LENGTH )/P I 

C P (41 = ( LINE(#2 INTENSITY)A(CONCENTRATIQN*PATH LENGTH)/PI 

C P( 5)=( LINE# I LQRENTZI AN HALF WIDTH )=(LINE¥2 LORENTZIAN HALF WIDTH) 

C P(6)=WAV6NUM8ER OF LINE CENTER OF LINES1 

C P(7)=WAVENUM9ER OF LINE CENTER OF LINES2 

C P(8)=SLIT FUNCTION FULL HALF WIDTH 

C OOP (I)=OOPPLER HALF WIDTH OF LINE# I 

C D0 p (2)=00PPLER HALF WIDTH OF LINE22 

C CONST f 1 )=SQRT f LNf 2 ) ) /OOP { 1 ) 

C CONST (2)=S3RT( LN( 2) ) /0QP(2) 

X0=76. 343000 
OOP(1)=0. 2675490-2 
DOP (2 )=0. 27505 70-2 
CONST ( I )=3. 111731002 
CONST {2 ) =3 .026840002 
PC1=P (3 ) *CONST (1) 

PC2=P (4)* C0NST12 ) 

PCC1=PC1*CGNST( 1 ) 

PCC2=PC2*CQNST ( 2 ) 

8AC<=(P<l)+P<2)*(X(l)-X0) ) 

F=0.00 
00 1 1=1,8 
OF ( I ) =0.00 
1 CONTINUE 

DQ 2 K=l,32 

0 (K)= 1 33.00—2*0/32. DO 
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2 CONTINUE 

00 3 1=1,31,2 

N=( 1+11/2 

ST { N 1 =1/32 .DO 

ST (N+16>={ 32.00-1 1/32. DO 

3 CONTINUE 

00 4 <=1,32 

CM(K.) = (X(I)+8(K.)*P(8) 1 

YEU ( 1 )=OABS(CM(K)-P( 6) 1 

YEU( 2 )=OABS t CM (K 1 — P (7) I 

DO 5 J=l,2 

Ut J)=Y£U( 4)*CGNSTC4) 

V(4)=P(5) *CONST( J 1 
S(4)=UU)*U<4)-Vt4)*V(4) 

T(J)=2.D0*U< J1*V( J) 

22 ( J ) =OCMPLX { S ( 4 > , T{ 4 ) ) 

IF(V( Jl.EC.O.ODO) GO TO 6 

IF(V(J).GE.5.0DO.OR.U<4).G£.5.CD0) GO TO 7 
IF(VCJ).G=.I.800.0R.U(4).GE.3.000) GC TO S 
6 CONTINUE 

USER < J ) =V ( 4 ) 

VSER ( J ) =— U( J ) 

UN ( 4 1 =V t 4 ) 

VN t J )=— U ( J ) 

U2( J ) =— S ( J ) 

V2 ( 4 ) =— T { J ) 

M=6 . 3 42Q0*U ( J 1 +3 . ODO 
IF ( M.GT .29) M=29 
1F(U(J) .EC. 0.000) M=15 
DO 9 19=1, M 

UNEW< J!=UN( J)*U2( 4>-VN( J )*V2(4 ) 

VNEW< 4>=V2(4)*UN(J)+VN(J)*U2(4> 

USER ( J J =USER 1 4 ) +UNEW ( J ) *AN (19+1) 

VS ER ( 4 > =VSER { 4 )+VNEW ( 4 ) *AN ( 19+1 ) 

UN { 4 ) =UN£W ( 4 ) 

VN (4 )=VNEW ( J ) 

9 CONTINUE 

0E(4)=DEXP(— SI4) 1 

31 (4 ) = ( U ( 4 ) *OCCS ( — T( 4 ) ) — V ( 4 )*DSIN( — T( 4 ) ) 1 
C2 ( J ) =( V ( 4 ) *OCOS ( — T( 4 ) ) +U( 4 )*DS IN ( -T< 4 ) ) ) 
Gl(4)=tl.D0-PISQ*USER(4) ) 

G2 (4)=PISQ*VSERt J ) 

VG IGT( 4 )=DE( J)=MD COS( -T(4 ) )*G1(J)+0SIN(-T( J))*G2(J) ) 
R1U1=(-2.D0)*0E( J)*(Q1(4)*G1(4)+Q2(4)*G2(4)) 
URl(4)=(-2.00)*0c< J)*(QKJ) *(-G2(4) > +Q2 { 4 ) *G1 ( 4 > 1 
UR2( 4 ) =PISQ 
RDER(4)=R1(4) 

UDER ( 4 ) =UR1 ( 4 ) +UR2 { 4 ) 

GO TO 5 

3 R( 4 )=T ( J) *T ( 4 ) 

T(4)=T(4)*U(J) 

E(J)=SC4)-A6 
Gt4)=S(J)-A4 
H{4 )=S(4) — A2 
ZA114) =22 ( 4 1— A2 
ZA3 { 4 ) = 22 ( 4 1 — A4 
ZA5 ( J )=Z2 ( J )— A6 

2DER2 ( 4 1 =( A1/2A1 ( J J+A3/2A3 ( J ) +A 5/2 A 5( J ) )- 
* (2.D0*Z2<4) )*( Al/ZAK 4)**2+A3/ZA3(J)**2+A5/ZA5(4)**2) 

V0IGT(4)=A1*((T(4)-H(4)*V(4) } /( H(4 ) *H ( 4 ) +R ( 4) 1 ) + 
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* A3$((T(J)-GfJ)*V(jn/{6(J)*GtJ)+R<Jn) + 

=1= A 5* { ( T{ J ) -E (J ) *V (J ) ) / < E ( J ) *S U > +R { J >>) 

RDER t J ) =-0 IMAG t ZDER2 ( J > ) 

UDER ( J ) =DREAL( Z0ER2( J) > 

GO TO 5 

7 R(J)=T( J)*T( J) 

T( J)=T( 

E ( J ) = $ ( J ) -B2 
G( J ) =S { J) — 34 
ZBI{J)=Z2(J)-B2 
ZB3<J)=Z2< Jl-84 

ZDER31J )= ( 8 1/ZB l ( J >+33/Z33 ( J) )- 

* (2.D0*Z2(J) )*<31/Z91( J)**2+B3/ZB3<J)+*2) 

VQIGT{J) = 31F((T(J)— £{J)*V{J)}/(E(J)=*E(J)+R{J) ) )+ 

* 83*((T[J)-G(J)*V(J))/{G(J)4G(JJ+R(J) )) 

ROER ( J ) =-OIMAG ( ZOER3 ( J > } 

UDER { J ) =OREAL( Z0ER3( J ) ) 

5 CONTINUE 

AKU { K ) = C PC1*VQ IGT{ 1) +PC2*VOIGT{ 2) ) *PI SORT 
TM{K>=OEXPC-AKU(K) ) 

RR1=BAC.<*ST (K )^*TM (K)/16.Q0 

OFF ( 3 )=— VOIGT ( 1) 4C0NST ( 1 JFPISQRT 

DFF{ 4)=— VQIGTt 2) FCGNST( 2 )*P I53RT 

DFF{ 5 ) = { PCC1*UDER. 1 1 ) +PCC2*UDER ( 2 ) ) *P I SORT 

IF{C'MK).EQ.P(6) ) GO TO 14 

IP<C'KKKLT.P(6) ) GO TO 11 

DFF(6)=PCC1*RD£R< 1)*PISQRT 

GO TO 12 

11 3FF(6)=-PCC1*RDER(1)*PISQRT 
GO TO 12 

14 DFF(6)=0.000 

12 CONTINUE 

I F ( CM ( K } . EQ . P ( 7 ) ) GO TO 18 
IF{CM(.O.LT.P(7) ) GO TO 15 
OFF( 7 J=PCC2*R0ER( 2)*PISQRT 
GO TO 16 

15 0FF<7)=-PCC2*RDER(2)*PISCRT 
GO TO 16 

18 OFF( 7 )=0. ODO 

16 CONTINUE 

DPP( 8 ) =— ( OFF (6 )+OFF( 7) )*6 (K ) 

F=F+RR1 

DF{l)=OF(l) +ST (K ) FTM (K ) /16. DO 
DP<2)=OF<2) + ST(K)*TM(K)/16.00F(X(1>-Xa) 

DF(3)=OF(3)+RRl*DFF{3) 

DF<4)=OP{4)+RRl*OFF(4) 

OF ( 5 )=QFl 5 J+RR1*0FF( 5 3 
DF16)=DF{6)+RR1*0FF{6) 

DF(7I=DFt7)+RRlFDFF(7) 

DF(8)=DF(8)+RR1*0FF{8> 

4 CONTINUE 
RETURN 
END 

//GO EXEC PGM=LOADER,PARM=*MAP,EP=MAINSCONO=(S,LT),TIME=( 11,30) 

//GO.FTOlFOOl DO UNIT=SYSDA,SPACE = (CYL,( 1,1 ) ) , 

// 0CS=(RECFM=VBS T LRECL=324,BLKSIZE=2298) 

//FT02F001 00 UNIT=SYSOA r SPACE=(CYL,ll,l) ) 

//FT05F001 DO ODNAME^SYSIN 

//FT06F001 DO SYS0UT=A,DC3={RECFH=FBA, LRECL=133,PLKSIZE=931,3UFN0=1 ) 

//SYSLIB OD 0SNAME=SYS1.F0RT!_IB,DISP=SHR 
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//GG.SYSLIN DD DSN=L£03 J, 01 SP = (OLD ,0 ELETE } 

// 30 OSNAME=SYS4.3MOPLIS(3MDP3R),QISP=Sbft 

//SYSLOUT DO SYSOUT=A,DCd=BUFNG=l 
//GO.SYSIN □ D * 

PROS TITLE IS ‘TEST OF VCIGT LINE*./ 

INPUT VARIABLES ARE 2. 

FORMAT IS * (FI0.4,5X,FI0.3) • . 

UNIT=10. 

CASE ARE 299./ 

VARIABLE NAMES ARE X,F./ 

REGRESSION TITLE IS ‘LINE PARAMETERS GF DOUBLE VOIGT LINE ' . 
OEPENO=F. 

T0L=0. 00000001. 

PARM=8. 

C0NST=2. 

ITER=45. 

HALF=20 ./ 

PARM INIT=5. 24-+, 0.0, 0.0 19, 0.0617, 0.02,75. 95 ,77. 7, 0.103 
MAX I MUMS ARE 6.3,0.1,0.3,0.6,0.14,77.0,79.0,0.9 
MINIMUMS ARE 4. 0,-0.1 , I .E-7,0 .00001 ,0. 0001 ,71 .0 ,73 .0, 0.000 I 
CQMST= ( 4) 1 , ( 3 ) —3 . 0S66 . K=0. 

CONST=( 3)1. K=0.104. 

NAMES=AI,A2,SU0VPI1,SU0VPI2,L0RENTZ,NEU1,NEU2,H./ 

END/ 

FINISH/ 

/ /GO • FT1QFQQ1 DO 05N=TS0744-.HCL .DATA ( LI N02 ) , DI SP=SHR 
// 
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APPENDIX II 


SUBROUTINE FOR USE WITH BMDPAR TO RETRIEVE 
THE PARAMETERS OF A SINGLE VOIGT LINE 


30 



// TIHE=6,REGI0N=5Q0K 

//CMP exec P6M=IFEAAB,PAKM=ID,TIH£=( 0,30) 

//S YSLIM 0D UNIT=SYSDA,DSNAME=S&08J,SPACE=<CYL»C1,1)), 

// OISP=CMaQ,PASS>,DCB=(RECFM=FB,LRECL=80,BLKSIZE=31-20} 

//SYSPRINT DO SYSaUT=A,DC3=3UFN0=l 
//CMP.SYSIN 00 * 

SUBROUTINE FUN < F , P,X ,N,KASE »NVAR ,NPAR ,IPASS , XLOSS ) 

REAL*8 PINPAR) ,XCNVAR) 

REAL*B B(32)»STl32)»CM(32)TX0,DaP,C0NST,PCl»PCCl»BACK,YEU,U»V»S»T, 

* USER,VSER,UN, VN,U2,V2,UNEW» VN£W,DE,G1,G2, VOIGT, Rl, 

* R,E,H,G,AKU(32),TM(32),RR1 

REALMS AN! 30)/l. 00000000,-. 33333333, i 10000000,-. 238095238E-1, 

* 4.62962963E— 3»— 7 .57575757 E— 4, 1 .068376068E— 4, — 1 .322751323E-5 , 

* 1.4589169QE— 6,— 1.450385222E— 7 » 1.312253296E— 8*— 1.089222104E— 9, 

* 3.3 50702795 E-1 1,-5. 94779401 4E-12, 3. 95542 9516 E-l 3 , 

* — 2.466327QIE— 14, 1.448326464E— 15, —8. 032735012E— 17 , 

* 4. 2214072S9E— 18,— 2.107855 19 IE— 19, 

* 1.002516494E— 20,— 4. 55 1846759 E— 22, 1. 977064754E— 23, 

4— 8.230149299E— 25,3.2S9260349E— 26,— 1.264107899E— 27,4.67848352E— 29, 

* -1. 66976 1793E-30,5.754191644E-32,-1.916942862E-33/, 

* Al/ .46131350/, 

* A2/. 19016350/, A3/. 09999216/, A4/1. 78449 270/, A5/. 002883394/, 

* A6/5. 52534370/ ,B I/. 51242424/, B2/. 27525510/, 83/. 05176536/, 

* B4/2. 72474500/, PISQ/1. 128379167/, PISQRT/1. 77245385091/ 

C FINO LINE PARAMETERS AND BACKGROUND USING VOIGT PROFILE ANO 

C TRIANGULAR SLIT FUNCTION 

C F=BACKGROUND*CONVOLVEO TRANSMITTANCE 

C BACK=8ACXGR0UND=(Al-»-A2*<X(l>-X0)> 

C PU)=A1 

C P<2)=A2 

C P(3)= (LINE INTENSITY )*(CONCENTRATIQN*PATH LENGTH) /PI 

C P(4)=L0RENTZIAN LINE HALF WIDTH 

C PI5)=WAVENUHBER OF LINE CENTER 

C P(6)=SLIT FUNCTION FULL HALF WIDTH 

C OQP=QOPPLER HALF WIDTH OF THE LINE 

C CONST=SQRT(LN(2))/DOP 

X0=0.7320D0 
D0P=0.275D— 2 
CONST=3.02747I313D02 
PC1=P(3)*CQNST 
PCC1=PC1*C0NST 
8ACK=(P(1)+P(2)*IXU )-X0) ) 

F=O.DO 

00 2 K=l,32 

B(K) = (33.DO-2*IU/32.DO 

2 CONTINUE 

00 3 1=1,31,2 

N=U+l)/2 

ST lN)=I/32. DO 

ST CN+16 >=< 32.D0-I >/32.D0 

3 CONTINUE 

DO 4 K=l,32 

CM(K)=(X(1)+B(K)*P(6) ) 

YEU=DA8S( CHIK.)— PIS ) ) 

U=YEU*CONST 

V=PI4)*C0NST 

S=U*U-V*V 

T=2.D0*U*V 

IFIV.EQ. 0.000) GO TO 6 
IFIV.GE.5.0D0.0R.U.GE.5.0D0) GO TO 7 
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IFtV. GE. 1.800. OR. U.GE.3.000J GO TO 8 

6 CONTINUE 
USER-V 
VSER=~U 
UN=V 
VN=-U 
U2=-S 

v2=-r 

M=6.842D0*U+8.000 
IFtH.GT.29) M=29 
IF(U.EQ.O.ODO) M=15 
DO 9 I9=I,M 
UNEW=UN*U2-VN*V2 
VNEW=V2*UN+VN*U2 
USER=U5ER+UNEW*AN( 19+1) 

VS£R=VSER-»-VNEW*AN< I9+X) 

UN=UNEW 
VN=VNEW 
9 CONTINUE 

OE=DEXP ( -S) 

Gl=(l.DO-PISQ*USER) 

GZ=PISO*VSER 

VOIGT =D£* { OCOS (— T ) *GX+DSIN ( — T ) *G2 ) 

GO TO 5 
3 R=T*T 
T=T*U 
E=S-A6 
G=S-A4 
H=S— A2 

VOIGT =A1*{ (T-H*V)/(H*H+R))+A3*( CT-G*V >/ (G*G+R) )+ 

* A5* ( ( T— £*V ) /( E*E+R ) ) 

GO TO 5 

7 R=T*T 
T=T*U 
E=S-B2 
G=S-34 

VQIGT-B1*C tT-£*V>/(E*E+R)>+B3*( <T-G*V)/tG*G+R) > 

5 CONTINUE 

AKUtK )=PC1*VQIGT*PIS3RT 
TM(K)=QEXP(-AKUtK) ) 

RRl=aACK*STtK)*THtKJ/I6.00 
F=F+RR1 
A- CONTINUE 
RETURN 
END 

//GO EXEC PGM=L0A0ER,PARK=»MAP t EP=MAIN»,C0N0=t8,LT),TIME=(5f30) 

//FTOIFOOI 00 UNIT=SYSDA,SPAC£=<CYL,(X»X) ) 

//FTO2FO0I DO UNIT=SYSDA,SPACE=(CYL»tl,I) )' 

//FT05F00I OD DDNAHE=SYSIN 

//FT06F00X 00 SYSOUT=A,DCB=( RECFH=FBA» LReCL=133,BLKSIZE=931,BUFN0=l ) 

//SYSUB 00 OSNAHE=SYSX.FORTLlB,DISP=SHR 
//SYSLIN 00 DSNAH6=Ce0BJ,0ISP=t0L0,0ELETE) 

// 00 OSNAM£=SYS4.BMDPLIBCBMOPAR),OISP=SHR 

//SYSLOUT 00 SYSQUT— AfOCB— SUFNQ^X 

//GO.SYSIN DO * 

/PROBLEM TITLE IS 'TEST*. 

/INPUT VARIABLES ARE 2. 

FORMAT IS »{F10.4,5X,F10.7) *. 

UNIT=iO. 

CASE ARE 50. 
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/VARIABLE NAMES ARE X,F. 

/REGR TITLE IS 'LINE PARAMETER OF SINGLE VOIGT LINE'. 
OEPEND=F. 

TO L=0 .OOOOOOOl. 

PARM=6. 

ITER=45. 

HALF=20. 

/PARAM INIT=5. 258, 0.0,0. 0075, 0.0046, 0.729, 0.0152 

MAX ARE 6.0,0.1,0.120,0.0990,0.9320,0.9 
MIN ARE 4. 0,-0. 1,1. E-7,0.00001, 0.3320, 0.00001 
NAME=Ai , A2,SU0VRP I,LORENTZ.» NEUO ,H. 

/END 

/FINISH 

//G0.FT1QFQO1 00 f>SN=TS0744.HCL.0ATA CANN J ,OISP=SHR 
// 
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RA9 . 

SPECTRAL ANALYSIS I. EXPERIMENTAL DESIGN 
E. NIPLE , J. H. SHAW 

The intensities, positions and widths of spectral lines can be obtained from absorption 
spectra provided the physical conditions of the sample (temperature, path length, etc.) are 
known. ' 

The accuracy with which these line parameters can be retrieved depends on many factors. 

An Information Theory approach is adopted to make these dependences explicit for the case 
of an isolated, pressure-broadened line. 


RA10. 

SPECTRAL ANALYSIS II. SMOOTHING AND INTERPOLATION OF SPECTRAL DATA 
M. HOKE , R. HAWKINS, J. H. SHAW 

The parameters of individual spectral lines have been successfully retrieved by non- 
linear regression analysis of experimental data. (1) 

The effect on these parameter values due to incorrect modeling (e.g. improper choice of the 
number of lines present) or of prior smoothing or interpolation of the raw data have been 
investigated by analyzing synthetic spectra with added noise. 

Examples of the effects studied will be presented. 

L. Y. Chang, J. Shaw, Appl. Spectros. 31 . 213 (1977). 


RA11. 

ANALYSIS OF SPECTRA III. VOIGT SHAPED LINES 
C. L. LIN AND J. H. SHAW 

Pressure broadening of spectral lines can cause fine details in the absorption spectra of 
gases to be lost. As the pressure is decreased the details are observed but the line shape 
typically changes from a Lorentz shape to the Voigt shape. It is shown that, provided the 
physical conditions of the sample are known, the line intensity, position and Lorentz width 
can be determined by spectral curve fitting for a wide variety of Voigt shaped lines. The 
technique of Pierluissi et al. (1) was used to calculate the absorption coefficients of the 
lines . 


1. J. H. Pierluissi, P. C. Vanderwood, and R. B. Gomez, JQSRT,_18,555 (1977). 
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RA12. 

SPECTRAL ANALYSIS IV. WHOLE BAND ANALYSIS OF ROTATION-VIBRATION BANDS OF DIATOMIC GASES 


J. H. SHAW , C. L. LIN 

Absorption speeds recorded by Fourier Transform Spectrometers consist of the transmitted 
signals at discrete frequencies. Spectra of the 1-0 bands of H^ci and H-^Cl observed at a 1 
nominal resolution of 0.1 cnT* contain more than 8000 signal values. 

The individual line positions, intensities and (Lorentz) widths can be described by proper 
choice of 23 adjustable parameters in the expressions for the upper and lower state term 
values, the band intensities, Herman-Wallis factors, and the J dependence of the line widths. 
The observed spectrum can be fitted by synthetic spectra calculated from these expressions 
provided the shape (assumed triangular) and width of the spectral response function of the 
instrument and the shape (assumed to have a quadratic dependence on frequency) and position of 
the background curve can be determined. 

i 

Values for the 27 unknown parameters of these models have been simultaneously determined » 
by least squares fitting more than 3500 signal values within ±5 cm~l of the HCL line centers 
in a single spectrum. The standard deviations of the values and the correlations between the 
parameters were also determined. The results are compared with previously estimated values. 

This method of band analysis requires models for all the factors which influence the 
spectrum and large amounts of computer time, but no experimenter manipulation of the raw data. 


RA13. 

SPECTRAL ANALYSIS V. WHOLE-BAND ANALYSIS OF ROTATION-VIBRATION BANDS OF LINEAR MOLECULES - N 2 0 
- R. HAWKINS , M HOKE, J. H. SHAW 

The whole-band analysis technique described by Shaw and Lin (1) is being applied to N 2 O 
absorption bands. Progress in applying this technique to N 2 O spectra obtained with a Fourier 
Transform Spectrometer is described. 


1. J. H. Shaw, C. L. Lin (preceding paper in this session) 



